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INTER-LINGUAL  SYNTAX  BRACKETT ING  PROGRAMS 


The  work  outlined  in  this  report  stems  from  the 
ideas  of  Dr.  A. S'.  Parker-Rhodes ,  who  constructed  the 
first  syntax  dictionary  for  English  which  appears  as 
Appendix  I. 

Sections  I  and  III  have  been  prepared  jointly  by 
D.S.  Linney  and  R.  McKinnon  Wood,  who  is  responsible 
for  the  final  formulation  of  the  biacketting  Algorithm. 

Section  II  is  entirely  due  to  R. McKinnon  Wood. 

Section  IV  was  done  by  Y.  Wilks,  who  also  carried 
through  the  extension  of  the  governor-dependence 
syntax  dictionary  with  Miss  J3.  Rigby,  (Appendix  II). 


INTRODUCTION 


The  object  of  a  sybtax-bracketting  program  is  to  enable 
a  oomputer  to  analyse  a  sentence  In  the  way  that  a  child  in 
high  school  does  it*  This  activity,  variously  described  as 
'syntax  'in  the  U.  8, A*  and  "analysis'*  in  Great  Britain,  is 
hallowed  by  time  and  the  grammarians*  It  is  usually  believed 
to  elucidate  the  structure  of  a  sentence,  and  uses  as  its 
fundamental  notions  subject,  predicate,  and  the  names  of 
parts  of  speech,  noun,  verb,  adjective,  adverb,  preposition,, 
conjunction.  Various  notations  may  be  ueedj  0.  F.  Hockett 
uses  diagrams j  but  it  is  more  convenient  to  use  brackets.  We 
shall  use  curved  left-hand  and  right-hand  brackets  and  permit 
their  iteration. 

This  process  results  in  a  grouping  or  bracks ttlng  of 
all  the  words  in  a  sentence  which  depends  to  a  large  extent 
on  their  syntactical  and  grammatical  relations:  a  machine 
bracketting  program  must  depend  only  on  these  relations;  for 
a  digital  oomputer  is  not  a  human  being. 

The  importance  of  such  bracketting  procedure  for  mechan¬ 
ical  translation  is  considerable.  Since  the  procedure  depends 
on  the  machine  having  only  syntactical  information  con¬ 
cerning  the  language,  considerations  of  meaning  are  not 
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involved.  An  analysed  or  bracketted  sentence  may  then  be 
thrown  into  a  standard  form  (e.g,  with  the  governor  at  the 
end  of  the  brackot-group)  which  will  simplify  the  operation 
of  a  translation  algorithm.  Further,  the  bracket-groups  will 
usually  be  semantical  units  on  which  semantical  algorithms 
may  be  performed:  and  the  translation  of  such  units  via  a 
semantic  intorlingua  is  more  likely, 

Park3r-Rhodes  (V  writos: 

Lven  when  two  languages  express  themselves  in  an 
apparently  quite  different  way,  the  standard  form  of  their 
sentences  is  often  identical.  Thus  consider 

The  senior  author  was  me  ((tne  senior  author) (me  was)) 
Moi  j'etais  l’auteur  principal  ( (l* principal  auteur) 
moi-j'etais)) 

The  fundamental  notions  on  which  syntactical  analysis 
depends  ais  those  of  substituont  and  syntactic  function,  and 
the  derivative  notions  of  substituent  type  and  participation 
class  as  expounded  and  defined  in  Parker-Rhodes  (9)  and  (10) 
and  in  a  series  of  publications  by  the  C,L, R. U,  :  M.L,  136; 
M,L,  147;  JioL.  154;  aUL,  153’>  The  traditional  grammatical 
categories  are  thus  replaced. 

A  substituent  is  any  part  of  a  sentence,  a  word  or  a 
phrase  or  a  clause  or  the  sentence  itself,  that  is  meaningful. 
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It  Is  difficult  in  general,  to  assort  that  morphemes  are 
substituents  and  only  some  punctuation  marks  oan  be  so  treated. 

What  is  a  correct  machine  analysis  of  a  sentence? 

These  terms  permit  us  to  define  recursively  a  valid 
bracketting  of  a  sentence  as  an  analysis  such  that 

(1)  A  word  is  a  substituent 

(2)  The  set  of  substituents  that  form  the  group  form  a 

valid  bracket  under  a  computable  algorithm:  in  other 
words,  there  is  a  function  jt)  f  >(.»n,  whore  the 
values  of  the  function  are  0  and  '  I,  and  the 
bracket  is  valid,  if  and  only  if  ,  a  ,  =1 

(3)  The  sentence  is  exhibited  as  a  single  substituent. 

This  definition  permits  the  existence  of  syntactically 
ambiguous  sentences,  which  reflects  the  actual  situation  in 
language.  Another  consequence  is  that  a  correct  analysis  of 
the  sentence  must  exhaust  all  the  substituents. 

Apart  from  this  definition,  we  make  the  restriction 
that  valid  brackets  must  consist  of  contiguous  substituents. 
This  may  require  re-erdoring  of  substituents  to  be  done 
prior  to  analysis,  by  unilingual  routines.  This  condition 
derives  from  any  computation  which  proceeds  strategically 
from  the  inside  of  a  sentonce  out,  in  other  words  which  comr- 
paros  substituents  and  groups  them:  it  would  net  hold  for  a 
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target  strategy  which  attempted  to  break  the  sentence  down 
from  tho  outside  in.  Since  the  algorithm  we  have  developed 
is  of  tho  first  kind,  target  strategies  are  precluded. 

Syntactical  Properties: 

The  lattice  framework  of  the  Parkor-Rhodes  Model  la 
used  to  provide  a  set  of  substituent  types,  which  acts  as  a 
combinatorial  frame  into, which  ono  may  sort  words  according 
to  certain  properties.  These  properties  are  not  themselves 
provided  by  tho  model,  but  by  the  language  concerned.  The 
properties  considered  in  this  study  are 

(1)  Boing  a  governor  or  being  a  dependent 

(2)  .Being  initial  and  being  final 

(3)  Concord  or  agreement  with  respect  to  grammatical 
number,  i,e,  being  singular  and  being  plural 

(1)  is  discussed  in  M, L,  147  and  is  related  to  tradit¬ 
ional  grammatical  notions  of  modification, 

(2)  is  tho  fundamental  property  of  the  word-order  of 
a  bracket-group,  and  cf  a  sentence.  It  is  necessarily  the 
case  that  speech,  and  thus  (derivatively)  written  text,  is 
simply  ordered  by  tho  fact  that  it  ie  a  temporal  process. 

It  is  conceivable  that  some  written  language  might  deliber¬ 
ately  ignore  order,  but  it  is  a  fairly  general  fact  that 


5 


languages  do  not  -  thoy  make  use  of  this  order  to  eliminate 
syntactical  ambiguity  and  to  specify  moaning.  Wo  shall 
make  use  of  the  idea  of  precedence  and  demonstrate  its  power. 

Three  obvious  questions  that  suggest  themselves  are 

(a)  Can  the  substituent  occur  in  the  first  place  in  substi¬ 

tuent  typo  i  ? 

(b)  Can  the  substituent  occur  in  the  last  place  in  Substi¬ 

tuent  typo  i  ? 

(c)  Can  it  occur  elsewhere  in  substituent  typ3  i  ? 

It  is  clear,  however,  that  the  information  derived  from 
answers  to  these  questions  would  not  allow  us  to  use  all  the 
relationships  of  precedence  which  may  occur  in  a  bracket 
group  of  more  than  three  substituent  elements.  To  exhaust 
the  possibilities  in  a  bracket  group  of  n  elements  would 
roquire  n  questions.  Since  the  property  of  preceding  is 
defined  to  apply  to  finite  and  infinite  sequences,  it  is 
bettor  to  have  the  properties  of  initially  and  finality  de¬ 
fined  generally  in  terms  of  precedence.  We  define;  an 
initial  substituent  is  one  that  cannot  be  preceded  by  a  sub¬ 
stituent  that  is  not  initial,  A  final  substituent  is  one 
that  cannot  precede  one  that  is  not  final.  First  and  last 
substituents  may  bo  defined  as  those  that  are  preceded  by 

nono  and  that  precede  none  of  the  substituents  in  a  given 
bracket  group.  And  since  we  are  dealing  with  a  simple 
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ordering,  any  two  substituents  will  precede  or  be  preceded 
one  by  tho  other. 

The  characteristic  word-order  of  tho  sentence  in  a 
givon  language,  which  reflects  the  thought  of  the  people 
who  use  it,  will  bo  gi/en  by  the  characteristic  word-order 
possibilities  which  different  substituent  types  have. 

(3)  Concord  in  general  uight  bo  thought  to  be  a 
property  which  consists  in  actual  agroemont,  according  to 
specif  iablo  rulo ,  of  the  shapes  of  signs,  e.g.  endings. 

It  is  true  that  this  is  the  case  with  some  languages,  e.g. 
Gorman,  but  even  hero  it  breaks  down  bocause  of  exceptions. 

In  tho  case  of  English,  tho  singularity  or  plurality  of  a 
word  cannot  be  inferred  from  tho  grammatical  shapo  at  all: 
there  is  a  difference  botween  'person'  and  'persons'  but 
not  betwoon  'sheep'  and  'sheep',  or  'class'  and  'class'. 
V/hothor  a  word  is  marked  as  'singular'  or, 'plural'  depends  on 
the  person  constructing  tho  dictionary  asking  a  quostion 
about  tho  number  of  tho  word,  and  this  does,  in  some  sense 
cloarly  involve  referring  to  a  category  'number',  which  may 
bo  exemplified  differently  in  different  languages,  and  which 
could  veryieasily  be  called  a  semantical  category. 

In  tho  Parker- Rhodos  syntax  dictionary,  M.L.  159, 
Appendix  No.  1. ,  no  concord  participation  class  has  been 
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assigned  to  tho  subetituont  typos,  but  this  is  not  difficult: 
o.g,  conjunct  groups  with  "and"  or  "both. ...  and"  are 
always  plural,  and  a  nominal  O-cliii  is  always  singular* 

Tho  essential  point  about  any  property  that  is  to  be 
of  use  in  the  analysis  of  a  sentence  is  that  tho  dictionary- 
maker  should  be  able  to  assign  a  value  1  or  0,  (or 
perhaps  a  probability  value  lying  between  1  and  0)  to  the 
question,  -  doos  substituent  i  have  tne  given  property  ? 

It  is  only  of  peripheral  importance  whether  or  not  such  a 
proporty,  o.g,  singularity  -  is  "syntactical "  or  "seman¬ 
tical"  under  some  arbitrary  definition.  It  is  clear  that 
tho  properties  wo  are  using  are  close  to  traditional  gram¬ 
matical  properties  and  so  must  bo  called  syntactical;  it  Is 
equally  clear  that  the  person  making  the  dictionary  entries 
is  using  semantical  categories.  This  is  not  surprising  when 
one  realiaos  that  grammatical  concepts,  such  as  noun,  verb. 
subject  predicate .  etc.  have  been  defined  traditionally  with 
referomo  to  semantical  notions,  such  as  naming,  action, 
assertion. 

If  syntactical  properties  were  defined,  as  Tarski  (17) 
does,  as  properties  that  dopend  only  on  the  kind,  shape  and 
order  of  the  signs,  then  it  could  be  argued  that  no  proper¬ 
ties,  such  as  ’being  a  governor*  or  ’agreement  with  respect 
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to  number  or  gender*  are  purely  syntactical:  a  purely 
syntactical  analysis  would  then  depend  solely  on  the  property 
of  word-order:  this  may  be  possible,  but  would,  in  the  case 
of  a  number  of  languages,  placo  restrictions  on  the  strength 
of  any  syntactical  bracketting  algorithm  which  are 
insurmountable. 

From  the  point  of  view  of  the  bracketting  algorithm 
which  wo  have  developed,  any  further  property,  to  which  the 
dictionary-maker  can  assign  an  answer  without  unreasonable 
uncertainty,  may  be  added  to  the  algorithm,  or  replace  any 
soction  of  it,  if  experience  should  show  that  it  is  more 
powerful.  In  this  contoxt  then,  distinctions  between  syntax 
and  semantics  become  uninteresting.  Incidentally,  one  may 
define  a  sentence  puroly  syntactically  -  the  set  of 
substituents  bounded  by  full-stops,  -  whore  'full-stops’ 
includos  question  marks  and  exclamation  marks. 
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SECTION  I. 


The  Brackotting  Algorithm 

The  logical  methods  used  in  the  presentation  of  the 
algorithm  aro  derived  from  Or.  Boole  (1)  and  (2)  and  are 
dovolopod  and  expounded  in  Section  4  of  this  report. 

l)  Wo  will  assume  the  existence  of  a  Syntax  Dictionary 
for  the  language  in  question,  consisting  of  a  list  of  words 
attached  to  an  appropriate  Dictionary  Entry.  This  entry  will 
bo  dividod  into  throe  sections  giving  the  Governor- Dope ndent 
or  G,D.  entry,  the  VYord-Qrdor  or  W,  0.  entry  and  the  Concord 
entry,  respectively.  This  dictionary  will  also  contain 
participation-class  entries  for  the  substituent  types. 

Each  of  those  three  entries  is  in  turn  subdivided  into 
not  more  than  23  units,  and  in  practice  about  13  (chosen  for 
tho  language  in  question);  those  units  correspond  to  each 
of  tho  23  substituent  types  possible  in  the  Parker-Rhodos 
lattice  model.  Each  of  those  units  contains,  coded  in  a 
suitable  number  of  bits,  tho  information  concerning  tho  possi¬ 
bility  of  tho  participation  of  the  word  in  that  substituent 
type, 

(a)  Tho  Govern or- Do pendent  Entry i 


Each  unit  in  this  section  contains  two  bits,  those  being 
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tho  answors  to  the  following  two  quostions:  (l)  Can  the 
word  occur  in  that  particular  substituent  type  as  a  Govornor? 
(2)  Can  tho  word  occur  in  that  particular  substituent  typo 
as  a  Dopondont? 

Tho  answor  is  codod  as  a  '1*  for  ’Yos' ,  and  an 
•O'  for  'No’,  It  is  cloar  that  a  word  with  a  G.D.  unit 
reading  00  can  occur  as  neithor  Governor  nor  Dopondont; 
that  is,  it  cannot  occur  at  all  in  that  substituont  typo. 
Further,  a  word  with  a  G.D.  reading  01  can  occur  only  as  a 
dopondont,  with  a  reading  10  can  occur  only  as  a  govornor, 
and  11  can  occur  as  either. 

For  convonienco  in  reading,  those  two  bits  are  gener¬ 
ally  expressed  as  a  quartal  numbor,  i.e,  00  is  written 
as  C,  01  as  1,  10  as  2,  and  11  as  3.  For  the 

purposes  of  calculation,  and  for  purposes  of  logic  hero, 
tho  two  bits  are  troated  as  .independent  binary  variables, 

(b)  Tho  IVorfl-Ordcr  Entry; 

Each  unit  in  this  section  again  contains  two  bits, 
giving  tho  answers  to  tho  following  questions,  with  tho  same 
binary  coding:  (l)  Can  tho  word  occur  in  that  particular 
substitxxont  typo  as  a  ’Final'  word?  (2)  Can  the  word  occur 
in  that  particular  substituont  typo  as  an  ’Initial’  word? 
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Horo  a  ’final*  word  is  defined  to  to  one  that  may  and  a 
bracket  group  and  may  not  precede  an  'initial'  word,  and  an 
’initial'  word  is  defined  to  bo  one  that  may  start  a  bracket 
group  and  mty  not  bo  procoded  by  a  'final'  word. 

As  with  the  G.D,  ontry,  we  have  four  possibilities  for 
tho  ontry  in  oach  W„o.  unit.  Sinco  we  wish  to  give  effect 
to  the  order  of  procodonco  which  is  tho  fundamental  feature 
of  word-order  in  a  language,  tho  four  possible  ontrios  have 
tho  following  significance: 

00:  The  substituent  cannot  occur  at  all  in  that  substi¬ 
tuent  type; 

01:  Tho  substituent  may  procode  any  other  substituont,  but 
may  not  follow  a  substituont  with  tho  ontry  10; 

10:  The  substituont  may  bo  preceded  by  any  other  substi- 

not 

tuont  but  may/precedo  a  substituent  with  the  ontry  01; 

11:  The  substituent  may  precodo  or  bo  procoded  by  any 
cthor  substituent. 

( c )  Tho  Concord  Bntrv: 

The  number  of  bits  required  in  this  ontry  is  dependent 
on  tho  amount  of  concord  information  present  in  tho  language 
concerned:  some  languages  may  roquire  distinctions  ot  gender, 


mood,  person,  numbor,  etc,  to  bo  reflected.  In  the  caBe  of 
English,  which  we  consider  hero,  two  bits  are  required;  these 
are  tho  answers  to  the  questions: 


(l)  Can  tho  number  (in  the  normal  grammatical  sense) 
bo  plural  in  that  substituent  typo,  or  H  can  it  be 
singular  in  tho  first  or  seoond  person  in  that  substituent 


type? 


Koro  again  tho  ca3o  00  oxcludes  tho  participation  of 
tho  word  and  11  is  ambiguous  (as  is  tho  case  with  ’sheep'). 


It  is  worth  noting  that  a  word  may  bo  plural  in  one 
substituor t  type  and  singular  in  another*.  For  example,  tho 
word  'book'  would  bo  plural  when  used  as  a  verb,  but  singu¬ 
lar  when  usod  as  a  noun. 


A  Valid  Bracket  Group  maj^be  defined  as  follows: 

A  valid  bracket  group  consists  of  two  or  more  substi¬ 
tuents,  which  are  either  words  of  the  language  or  compound 
substituents  representing  other  valid  bracket  groups,  which 
satisfy  throe  separate  rules,  viz,,  tho  Q.D.  Rule,  tho 
W. 0,  Rule  and  the  Concord  Rule, 

Let  a  bracket  group  consist  of  n  substituents,  viz,, 

S1,  Sg,  ,,,  S*  ,,,  8n  where  n  i  2,  The  three  rules  may 
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then  bo  formulated 

(ft)  The  G.  D.  Ru3  o  t 

The  tra-kot  group  ia  valid  under  this  rule  if  and  only 
if:  for  at  least  one  substituont  typo,  corresponding  to  one 
of  the  units  of  tho  G.D.  section  of  tho  dictionary  entry  of 
the  substituonts|  (l)  for  every  substituont,  tho  substi¬ 
tuent  must  occur  as  a  governor,  or  as  a  dependent,  or  both, 

(2)  For  at  loaBt  one  substituent,  tho 
substituent  must  occur  as  a  governor, 

(3)  For  not  moro  than  one  substituont, 
the  substituent  may  occur  as  a  governor  but  not  as  a 
dependent. 

Wo  treat  tho  answors  to  the  questions  used  in  sotting 
up  tho  dictionary  as  propositions,  which  are  true  or  false, 
and  the  dictionary  coding  gives  us  1  or  0,  We  then 
apply  Boolean  propositional  logic  to  the  above  rule. 

Let  tho  entry  for  any  given  one  of  tho  units  in  the 
G.D,  section  of  tho  dictionary  for  substituont  type  S4'  be 

*1  *  ai  • 

1 

Then  Rule  (l)  may  be  formalised  as* 

For  all  i,  (gA  u  d^)*!. 
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Rulo  (2)  nay  "be  formalised  as: 

For  at  least  one  j  y  =»  1 

and  Rulo  (j)  as 

There  is  not  more  than  one  value  of  k  for  which 

(stk  n  <*k)  =*  1  • 

Wo  combine  these  rules  and  simplify,  thus  obtaining  the 
complete  G.Dt,  Rulo:  the  bracket  group  consisting  of  substi¬ 
tuents  ,  S2  ,  ^  is  valid  if  and  only  if 

di  n  d2  0  •••  n  di-i  n  %  n  di+1n  ...  nam  3  1,  (i^i  to  n). 

If  we  write  out  this  formula  in  full  for  n=»3  we  have 
(a^n  a2ng3^u(din  so  d-j  )  u  (g^^ndgna,  _  j 

(b)  The  W.  0,  Rule: 

A  point  of  interest  is  that  whereas  the  G. D.  Rulo  is 
commutative,  (i,e,  no  rogard  is  paid  to  the  order  of  sub¬ 
stituents  in  a  bracket  group),  this  cannot  be  true  of  the 
W. 0,  Rulo,  because  this  is  intended  to  reflect  information 
about  tho  order  of  words  in  a  text  sentence  -  the  order  of 
the  substltuonts  is  now  a  factor.  We  adopt  the  convention 


15. 


that  in  the  representation  of  the  bracket  group  as 

8.^  ,  Sg  ,  . ..  S1  ,  represents  the  last 

substituent,  Sg  the  second  last,  ...  and  8^  tho  first. 

The  bracket  group  is  valid  under  this  rule  if  and  on|r 
if:  for  at  least  one  substituont  typo 

(l)  Tho  last  substituent  must  bo  marked  as  final, 

1  (2)  Tho  first  substituent  must  bo  marked  as  initial, 

(3)  A  substituent  which  cannot  bo  initial  must  not  be 
followod  by  a  substituont  which  cannot  bo  final, 

(h)  A  substituont  which  cannot  bo  final  must  not  bo 

procodod  by  a  substituent  which  cannot  bo  initial. 

We  again  apply  Boolean  algobra,  and  representing  tho 

dictionary  units  in  the  W, 0,  soction  for  substituont  S  as 

^  wo  may  then  formalize  rule  (l)  as  P^  =■  1 

rule  (2)  as  =  1 

rule  (3)  as  for  any  1  f 

l  where  I .  =  1,  P 

and  rule  (4)  as  for  any  k  , 

whore  Pk  =  1,  1^  =>  1, 

Combining  those  rules  and  simplifying,  wo  obtain  the 
comploto  W. 0,  Rulo:  A  bracket  group  consisting  of  sub¬ 
stituents  S^t  s2,  . ..  8.^  8  is  valid  if  and  only  if 
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F1  n ,  n  n  i1+1  n.  i  -  1  (i»i  to  (n-D) 

If  wo  write  this  out  in  full  for  tho  case  n»3  we  hare 

(f-l  n  f2  n  i3  )  u  (  Fxn  i2n  ij  )  =>1 

(c)  The  Concord  Ihilei 

Tho  braokot  group  is  valid  under  this  rule  if  and  only 
i  if  for  any  substituont  type,  either  (l)  every  substituent 

is  marked  as  plural^  or  (2)  every  substituent 

is  marked  as  singular^  or  both.  (Wo  have  determined  that 
first  and  socond  parsons  singular  are  marked  as  plural. 

Thus  in  'I  say’,  both  'I*  and  ’say'  are  treated  as 
plural) , 

If  wo  roprosont  any  givon  dictionary  in  the  Concord 
section  as  P^  S.^  , 

then  wo  may  write  tuo  rule  as 

( 

(p1  n  p2  n  n  n  p.  n  ...  Pn)  u  (s1  ns2  On  nsj,  n 

2)  Tho  Practical  Algorithm 

It  is  ovidont  that  the  computation  of  these  rules,  with 
the  exception  of  tho  Concord  Rule,  presents  practical  prob¬ 
lems  when  considered  for  operation  over  the  complete  string 
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of  substituents  8lf  S^,  , ,,,  8,,  ,  Although  tho 

expression  of  those  rules  appears  simple,  it  is  nocessary 
first  to  find  tho  right  valuo  of  i  before  the  calculation 
can  procoad,  and  it  must  also  bo  borno  in  mind  that  tho  same 
calculation  must  bo  carriod  out  for  each  of  the  substituent 
typos  prosont  in  tho  givon  languages  before  it  is  known 
whothor  a  bracket  group  of  a  given  length  is  or  is  not  valid. 
As  this  form  of  tho  calculation  is  identical  for  each  of  the 
substituent  typos,  using  as  variables  tho  different  units  of 
tho  dictionary,  there  aro  clear  advantages  in  allowing  tho 
calculation  to  procoad  in  parallol,  whore  tho  computational 
possibilitios  of  parallol  calculation  in  modern  computers  can 
be  exploited  fully.  Thus  we  may  calculate  in  parallel  up  to 
1*0  bits  on  tho  EDSAC  II.  or  up  to  960  bits  on  80  column 
punchod  card  oquipmont,  thUB  obtaining  a  capacity  for  hand¬ 
ling  up  to  20  substituent  typos  on  EDSAC  II. 

Logical  operations  on  this  type  of  oquipmont  are  carried 
out  on  two  variables  at  a  time;  to  compute  a^b  Jc  it  is 
nocossary  to  compute  aUb=>x  and  then  compute  xUc,  Wo 
must  thoroforo  find  a  recursive  algorithm,  handling  only  two 
variablos  at  a  time,  but  operating  in  parallol  over  all  the 
substituont  typos,  for  oach  of  the  throo  rules  givon  abovo. 

In  thoso  algorithms,  wo  must  note  that  wo  do  not  know  the 
value  of  i,  whore  thoro  is  a  discontinuity  in  tho  expression 


18. 


( 


of  tho  rulo,  until  wo  got  there, 
(a)  Tho  G.D.  Algorithm: 


Although  this  rulo  is  commutative ,  the  W.O,  rule  is 
not  -  thus  a  combined  rule  will  not  bo.  Wo  will  therefore 
adopt  tho  same  convention  as  that  adopted  for  tho  first 
formulation  of  tho  W.O.  rulo,  and  consider  an  ordered 
bracket  group  ,  S2  ,  ,  SQ  $  whore  1b  the 

last  substituent  and  Sfi  is  tho  first.  We  again  use  tho 
representation  of  any  given  dictionary  unit  ub  d^^  . 


Wo  first  consider  S1  and  and  write  a  truth  table 

for  tho  16  possibilities  givon  by  all  possible  combinations 
of  gj_  i  g 2  ,  d1  ,  and  dg  ,  whioh  will  givo  tho  required 
result 


gl  d3L  g2  d2 

1111 
0  111 

10  11 
0  0  11 

110  1 
0  10  3. 

10  0  1 

0  0  0  1 

1110 
0  110 

10  10 
0  0  10 

110  0 
0  10  0 

10  0  0 

0  0  0  0 


g  d 
®o  o 

1  1 
1  1 
1  0 
0  0 
1  1 
0  1 
1  0 
0  0 
1  0 
1  0 
0  0 
0  0 
0  0 
0  0 
0  0 
0  0 


On  tho  left-hand  side  of 
this  table  wo  list  the  six¬ 
teen  possiblo  combinations 

&  2*  dl*  d2*  a®d 
on  tho  right-hand  side,  the 

desired  values  of  the  two 

functions  which  we  call 

*o  do  • 
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The  function  gQ  is  tho  funotion  required  by  the  G.D.  mle 
for  q  bracket  group  of  only  two  members,  The  function  dQ 
is  ono  which  is  =1  only  when  (l)  (g1u  d^)  n  (g2ud2)  31 

and  (2)  (g]nT’)n  (g2nd2)  =1 
i.e.  roferring  to  the  description  of  the  G.D.  Rule,  gQ  =»1 
^  tho^^eg^ulos  ^  8)3(2  (3)  are  satisf  iod^and  IB 

conditions  for  rule  (3)  have  not  arisen.  We  then  have 
four  possible  outooraes  of  tho  calculation  involving 

8Z  *  di  »  »2  »  and  d2  *  vi2:  d o  =  00»  01»  10» 

11,  Tho  case  00  ari&os  oithor  if  Rule  (1)  is  brokon,  i.e, 
or  S2  or  both  cannot  participate  or  if  Rule  (3)  ia 
brokon,  i.e.  both  S.^  and  S2  havo  entries  such  that 
Bi  ^  di  30  that  both  S1  and  S2  can  only  participate 

as  governors  in  that  brackot  group.  The  case  01  arises 
whon  both  and  S2  are  dependents  only  and  indicates 

that  tho  brackot  group  is  not  valid,  but  might  become  valid 
if  a  furthor  substituent  8^  wore  added  which  could  act  as 
governor.  The  caso  .10  indicatos  that  tho  bracket  group 
is  valid,  but  will  bccomo  invalid  if  a  furthor  substituent 
is  added  which  can  only  act  as  a  governor.  And  the  case  11 
Indicatos  that  tho  brackot  group  is  valid,  and  will  remain 
valid  whon  a  further  substituent  is  added,  either  as  a 
governor  or  os  a  dopondont.  A  deliberate  parallel  has  been 
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V. 


made  hero  between  the  form  of  tho  function  gQ  d0  and  the 
dictionary  unit  g ^  d.^  ,  so  that  we  may  treat  the  result 

of  our  computation  on  8^  and  S2  as  a  dummy  substituent 
with  dictionary  roforonce  gQ  dQ  ,  where  gQ  and  dQ 
have  exactly  tho  samo  offoct  as  a  real  dictionary  entry  g^ 
and  d  ^  • 

Wo  may  thus  roduce  tho  form  of  the  G.  D.  algorithm  from 


Z'  (^2  ®2  d2  *  *  f i  di  *  *  gn  dn  ^  3  1 


to 


[  Z 


|  L^gl  dl  g2  d2}  ]' 


g0  dQ  whore  Z*  »  gQ  ,  and  this  gives 
tho  roquirod  recursive  algorithm. 


Wo  now  roquiro  to  roduce  the  truth  table  given  above  to 
two  calculable  expressions,  one  for  gQ  and  one  for  dQ  . 

Wo  first  oxpross  tho  truth  table  in  Hilbert’s  Exceptional 
Normal  Form: 

go  3(g^dl  g2°d2^  u  *1*2*2^  u 

U  (gfHp  g2*d2  ^  U  <*ifWV  U  ^ginding2°d2^  U 

U  (g£id1Dg^i<^)  u  (^d£ig^d2). 

Simplifying  by  ropeatod  application  of  tho  formula 

0  x  U  0  *  *  0 

'  * 


21. 


we  obtain  gQ  =>  (d^pg^dg)  uCg^Hg^d^  U 

U  ^gfdl°ff2^  U  (Si^i^dg)  U  (d£ig^id2)  U 

U  (g1r'g2nd2')  u  (®£)  dlng2^  -' 

=(dln®2)  u  («ind2)  U  (g-jHdg)  u  (d^Ogg) 

=  (d.jng2)  u  (g1na2) 


Similarly 

do  “^gindl0g2nd2)  °  ^gindir^2^fl2^  U  ^1  di'‘S20d2^  U 

U  (5*  di  S2  d2^ 

=’(d1ng2nd2)  U  (giraind2)  U  u  (flinS2nd2J 

=  (d1nd2)  u  (dfd2) 

-  dl% 

Wo  may  thon  state  the  0, D-  algorithm  in  its  final  fora* 
Givon  a  string  of  substituents  ,  S2  ,  •  8n  ,  with 

dictionary  entries  g^  d^  ,  g2  d2  ...  g^  dn  ,  for  any 
given  substituent  type, 

Compute  g0  =»  (d.pg2)  u  (g-jpdg)  and  dQ=  d^dg 
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If  g0=»l  (  Sj.  S2)  is  a  valid  brackot  group  under  the 

G.  D.  rule. 

If  dQ  =»1  and  gQ  »0  then  (S1  8g)  is  not  a  valid 
brackot  group  undor  tho  G. D,  Rule,  but  (0-^  Sg  S^)  might 
bo  valid. 

If  gQ  and  dQ  are  both  =0,  S.^  and  8g  cannot 

belong  togothor  in  a  bracket  group  of  that  substituent  typo. 

Unloss  gQ  and  dQ  are  both  =0, 

Computo  gj  =  (d^g^)  U 

d'  -  <3^d, 
o  o  3 

If  gj,  =»1  (S1  Sg  S^)  is  a  valid  brackot  group. 

If  d'^  =1  and  g^  =0,  (S1  Sg  S^)  is  not  yet  a  valid  bracket 

group,  but  might  bo  come  so  when  is  added. 

If  Sq  “0  and  ct^*  =0  ,  S^,  Sg,  S^,  cannot  all  throe 

bolong  togothor  in  a  valid  bracket  group  of  that  substituent 
typo. 

Unless  g'^  and  d^  are  both  =0, 

computo  g*p  -  (a-Hg^)  u  (gona^)  ,  d‘0  =  d0°d4 
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Continue  until  some  sot  n  of  substituonts  8^* «...8Q 
forms  a  valid  braoket  in  some  substituent  type* 


Wo  now  apply  tho  same  technique  to  the  W. 0,  rule  given 
before.  We  must  first  note,  howevor,  that  wo  have  the  two 
rules  givon  for  tho  W, 0,  rule,  viz*  that  F1  =*1  and  IQ  =»1, 
Wo  cannot  fit  both  of  these  conditions  folicitously  into  e 
rocursivo  algorithm,  in  particular,  if  wo  start  with  substi¬ 
tuent  as  wo  did  for  the  G.D.  algorithm,  then  wo  cannot 
fit  this  condition  =1  into  a  recursive  algorithm,  for  it 
only  applios  to  S1  ,  We  will  thus  ignore  this  rulo  for  the 
qiomant  and  add  it  to  the  algorithm  later. 


Wo  construot  a  truth  table  as  before,  which  satisfied 
rulos  (2),  (3)  and  (4)  of  the  Word-Order  algorithm,  and 

is  rocursivo. 


F  I  F  I  FI 

1111  11 

0  111  0  1 

10  11  11 

0  0  11  0  0 

110  1  0  1 

0  10  1  0  1 

10  0  1  0  1 

0  0  0  1  0  0 

•  1110  10 

0  110  0  0 

10  10  10 

0  0  1  0  0  0 

1  1  0  0  0  0 

0  1  0  0  0  0 

1  0  0  0  0  0 

0  0  0  0  0  0 


I 
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Wo  may  oxprosa  this  as 

F0  »  (  p^i^ng  u  (p^n^n^)  u  (  pOin pny  u 

u  (P^^gfUg) 

=  (P1np2m2)  u  (Plnilr>2)  u  (p^n^)  u  (pnp^y 

-  F^Pg 

ar.d 

v  (Ffifip^ig)  u(P1ni1npgi  I2)  U  (P1rtnpnl2)  u  (p^ni^nig) 
u  (0jifip2ni2)  u(pinJir>2nr2) 

=  din»2ni2)  U  ^iVV  U  U  (^\%) 

u  (p^i^ig)  u  d1r>2nE2)u  (  p^p^i^ 

-  d1ni2)  ^  (Fini2) 

=■  v^Fiu^; 

Wo  may  no w  givo  tho  final  form  of  the  word-ordor 
algorithm  adding  tho  condition  that  F,  »  1  t 

Given  a  string  of  substituents  S1  ,  S2  ,  ,,,  , 

with  dictionary  entries  F,  I.  .  F„  I.  .  F„  I  .  for 

A  JL  d  d  D  Dr 


any  given  substituent  type,  where  8^  is  the  last  substituent 
in  tho  string  and  8^  the  first. 

If  P,  »0,  (8^  ,  8g  ,,,)  oannot  form  a  valid  bracket  group. 

If  P  »  1, 

i 

Compute  I0  =  lg  n(  F^UE^) 

and  a.  P,  H  p  ,  Then 

*r4*,V 

if  Io=«  1,  (8^  ,  8g  /  is  a  valid  bracket  groupA  the  word- 

ordor  rulo. 

If  PQ  =»1  and  IQ  *0,  (8^  ,  Sg  ,)  is  not  a  valid 

brackot  group,  but  (8^,  8g,  8^,)  might  bo  valid. 

If  I0=»  0  and  %  »0,  8-^  and  Sg  oannot  belong  together 

in  a  bracket  group  of  that  substituent  type. 

Unless  Io  and  Fo  are  both  »0, 

Compute  I0'  a  I3  n(PQ^0)  an*  P^  -  P0  n  then 

If  =»  1,  (8^,  8g,  8^)  is  a  valid  bracket  group. 

If  1&‘  »  0  and  Fp  al,  (S^,  Sg,  8^)  is  not  a  valid 

bracket  group,  but  (8^  8g,  8^,  8^)  now  might  be  valid. 

If  I^  aO  and  Fp  »0,  then  8^  ,  8g  ,  8^  cannot  be¬ 

long  together  in  a  brackot  group  of  that  substituent  type. 

Unless  I'J  and  Fp*  are  both  *0 
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Computo  i”  -  I^n(  F 'qSj  I0‘  ) 
ana  P*  =  . 

Continuo  until  for  some  set  of  substituents  8,  ,,.*8^ 

l  ’  n 

a  valid  brackot  is  formed. 

(c)  Tho  Concord  Algorithm; 

The  form  of  the  Concord  Rule  is  such  that  a  recursive 
algorithm  is  not  strictly  necessary.  As  wo  wish,  however, 
to  make  a  combinod  algorithm  of  all  three  miles,  it  iB  con¬ 
venient  to  design  tho  conoord  algorithm  in  this  form.  It 
can  oasily  bo  verified  that  a  recursive  algorithm  with 

8  =>  S,  0  8. 

o  1  2 

and  PQ  =>  P^Pg  (*here  ’S’  here  means  'being  singular' 
and  'P'  means  'being  plural'),  satisfies  the  concord  rule 
we  havo  given.  Thus  for  any  set  of  substituents  (A-^  Ag)  if 
either  SQ  or  P^  »1  thon  (A^  Ag)  form  a  valid  brackot 
group;  and  if  both  S4  and  P6  are  =0,  thon  A^  and  Ag 
cannot  appoar  in  a  bracket  group  of  that  substituont  type, 

(d)  Tho  Combinod  Algorithm: 

Wo  now  combino  tho  throo  algorithms  given  before.  First 
wo  noto  that  tho  failuro  of  any  of  tho  three  causes  tho 
brackot  group  to  be  rejected.  In  other  words 
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if 

*0 

and 

do 

are 

both 

=>0 

or 

if 

Po 

and 

xo 

are 

both 

»0 

or 

if 

Po 

and 

CO 

o 

are 

both 

=0 

or 

if 

pl 

is 

=*o, 

tho  brackot  group  fails. 

Thus  tho  bracket  group  is  rejected  if  and  on2y  if  the 
following  holds: 

Fau  (g0na0)  u(f0  ni0)  u(  P0n  s0  )  =1 

or  F1n(guao)n(F0uxo)n(  P0Us0  )  =o  . (1)  ana 

furthor  repetition  of  tho  algorithm  to  add  further  substi¬ 
tuents  will  also  fail.  Thus  this  gives  the  stop  rule. 

For  tho  bracket  group  to  bo  valid  as  it  stands,  all 
throe  rule 8  oust  be  satisfied,  that  is  wo  must  have  the 
following  gQ  =1 

and  IQ  »  1  , 

and  cither  P0  or  £U  =1.  Tho  brackot  group  is  thus  valid 
if  and  only  if  tho  following  holds: 

«onIo°'pouSo  >-1 . <2> 

In  all  other  cases  the  bracket  group  is  not  yet  valid, 
but  may  form  part  of  a  largor  brackot  group  which  will  bo 
valid,  and  tho  algorithm  must  bo  repoated  to  bring  in  tho 
next  substituent. 

Substituting  in  (2)  above  for  gQ ,  IQ,  and  80,  wo 
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have  the  following  as  tho  necessary  and  sufficient  condit¬ 
ion  for  tho  validity  of  a  brackat  groups 

c(d£g2)  u  (g£id2)  3  nci2n  )3  n  H^nSg)]  »i 

or  (d1ug1)n  (ajjn2)  n (g2ugl) n  (b2^2) nign (p^ix)  n 

n  (P3ys1)  n(p1LB2)n  (P2uSi)n  (p2us2)  =1 

and  substituting  in  (l)  above  wo  havo  the  following  as 
tho  no coss ary  and  sufficient  condition  for  the  failure  of  a 
brackot  group  (omitting  tho  special  starting  condition  of 
*1  =  1) 

CCdjnggJu  (g-jOd^u  (djOdg)  ]  n  i  (f£>f2)  u(ijni2)  u 
u  (p1m2)  ]  n  [(P^nPg)  u  (b£)82)j  »o . . 
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SECTION  2 


The  Strategy  of  a  Syntax  Progi'ammo 

We  have  given  a  definition  of  a  valid  bracket  group, 
in  terms  of  Dictionary  information,  and  we  have  obtained  a 
complete  algorithm  for  computing  whether  or  not  any  string 
of  substituents  will  form  a  valid  bracket  group.  We  must 
now  consider  the  types  of  stratogy  to  be  employed  in 
recursively  applying  this  algorithm  to  a  sentence,  so  as  to 
obtain  a  bracketting  analysis  of  the  sentence  such  that  the 
complete  sentonco  is  exhibited  as  a  substituent,  or  valid 
bracket  group.  Wo  will  illustrate  the  application  of 
differing  strategies  to  a  common  English  sentence  form: 

((ab)((ci)  (e(fgh) ) ) ) 

corresponding  for  example  to  the  sentence  'The  man  was 
sitting  on  the  park  bonch*. 

We  can  first  distinguish  between  two  broad  classes  of 
strategy.  The  first  is  ’  the  Target  stratogy  where 

we  ^  first  attempt  to  find  the  largest,  or  outer¬ 

most  bracket  grfcups,  then  split  each  of  these  into  their 
innor  groups,  and  so  on.  Using  the  example  above,  we  would 
first  look  for  (ab) (odo^feh) ,  then,  assuming  this  division 
to  be  corroct,  to  split  (cdofgh)  into  ((cd)(efgh)),  and 
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thon  ((cd) (o(fgh))). 

In  tho  second  strategy,  we  look  first  for  the  inner¬ 
most  bracket  groups,  and  thon  combine  them  into  success¬ 
ively  larger  groups  until  tho  whole  sentence  is  contained 
in  a  single  group.  Thus  wo  might  first  find  the  group 
(f gh) ,  thon  combino  this  with  the  word  ’o*  to  get 
(o(fgh)),  find  (cd)  and  combine  it  with  (e(fgh))  to 
got  ((cd) (o(fgh)))  and  thon  finally  combino  thi  with 
(ab)  to  got  tho  comploto  sentonce. 

If  it  woro  tho  caso  that  tho  brackotting  process  was 
entirely  unambiguous,  that  is,  that  in  no  caso  could  a 
bracket  group  bo  formed  which  would  not  bo  incorporated  in 
tho  final  brackotting  of  the  complete  sentence,  then  son- 
tenco  analysis  would  present  no  difficulties,  and  there 
would  bo  little  to  choose  botwoen  those  two  types  of 
stratogy.  This  would  imply  for  example  that  (gh)  or 
(of)  could  nevor,  in  any  context,  bo  a  valid  bracket  group. 

The  fact  that  this  is  not  the  caso  causos  both  typos 
of  stratogy,  if  simply  appliod,  to  run  into  difficulties, 

Tho  manner  in  which  this  breakdown  occurs  appears  to  be 
different  in  tho  two  typos  of  strategy,  but  is  in  fact 
basically  tho  some.  In  tho  oaso  of  tho  Target  strategy, 
tho  sontonoo  is  exhibited  as  a  bracket  group  from  the  start, 
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and  wo  look  for  the  oonstituonts  of  this  bracket  group. 

Thoro  will  in  practico  bo  a  number  of  difforont  possible 
constituents,  and  each  of  those  in  turn  must  bo  broken  down 
into  its  oonstituonts,  of  whioh  thoro  may  again  bo  a 
numbor  which  aro  possible ,  Each  assumption  concorning  tho 
oonstituonts  of  any  particular  brackot  group  loads  to  a 
troo  of  furthor  brackot  groups  and  further  assumptions  con¬ 
cerning  thoir  oonstituonts,  all  of  which  appear,  to  tho 
machino  at  loast,  to  bo  difforont,  and  all  of  which  have  to 
bo  followod  up.  Wo  cannoc  know  which  branch  or  branches  of 
this  troo  will  provo  to  bo  corroct  oxcopt  by  discovering 
thoso  which  ovontually  load  to  failuro,  which  thoy  do  by 
exhibiting  a  brackot  group  with  no  possiblo  oonstituonts. 

This  will  not  nocossarily  happen  until  tho  lovol  of  tho 
words  to  tho  santonco  is  roachod. 

In  tho  case  of  tho  second  strategy,  on  which  tfce  prev¬ 
ious  computer  programme  of  tho  C.L.R.U, ,  M.L.  136,  was 
basod,  tho  breakdown  occurs  in  an  apparently  very  difforont 
way.  By  looking  first  for  tho  innermost  brackot  groups,  that 
is,  groups  formed  from  tho  sontonco  words,  and  thon  success¬ 
ively  combining  them  into  largor  groups,  wo  can  bo  cortain 
that  any  grfcup  formod  is  valid,  and  that  its  constituents 
aro  also  valid,  Howovor,  whon  attempting  to  form  tho  groups 
wo  will  In  practico  havo  a  numbor  of  choioos.  Thus  in  tho 
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oxamplo  abovo,  wo  might  find  that  not  only  is  (fgh)  a 
valid  group,  and  this  loads  to  (o(fgh)),  but  also  (of) 
and  (gh).  If  thon  ((of)(gh))  is  not  a  valid  group,  wo 
can  discard  (of)  and  (gh),  but  it  may  well  occur  that 
tho  lattor  group  is  also  valid,  and  wo  might,  pursue  it 
successfully  up  to  tho  ond  of  tho  sontonco,  only  to  find 
that  tho  last  bracket  group  containing  tho  wholo  sontonoo 
cannot  bo  forraod.  As  in  tho  caso  of  tho  Target  stratogy,  wo 
again  obtain  a  troc,  oach  branch  of  which  is  formed  by  tho 
different  possibilities  oi‘  combining  a  given  sat  of  substi¬ 
tuents  into  valid  brackot  groups.  And  again,  we  may  not  bo 
able  to  discard  a  branch  until  tho  analysis  has  boon 
comploted, 

Givon  tho  existonco  of  this  problom,  thoro  a^  basic¬ 
ally  two  sorts  of  tactics  which  can  bo  omployed  -  that  of 
parallol  computation  and  that  of  sorios  computation*  Thoso 
apply  equally  to  tho  two  strategics  described  abovo.  Wo 
must  accopt  that  wo  shall  have  to  computo  brackotting  possi¬ 
bilities  that  will  lator  provo  to  bo  unacceptable,  and  wo 
must  docido  whotnor  to  computo  those  in  parallel,  keopipg  in 
tho  storo  all  tho  possible  brackottings  obtainod  and  follow¬ 
ing  up  all  tho  branohos  of  tho  troo  at  oach  stago,  or 
whether  to  computo  in  series.  In  this  lattor  caso,  wo 
chooso  at  each  stago  one  only  of  tho  brackot  possibilities, 
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and  thus  follow  up  only  ono  branch  at  ono  time.  If  tho  pro- 
coduro  falls,  wo  thon  track  back  up  tho  branch,  chooso  a 
difforont  bracket  possibility,  and  follow  down  a  difforont 
branch. 

In  tho  provious  C.L. R. U.  Progratnmo,  wo  adopted  sorios 
computation,  Tho  small  sizo  of  tho  thon  EDSAC  II  storo  made 
this  method  most  convenient,  and  it  also  gavo  us  tho  opportu¬ 
nity  of  studying  tho  offocts  of  a  priority  ordoring  on  tho 
brackctting  possibilities.  At  every  stage  in  the  computation, 
when  wo  aro  combining  a  sot  of  substituents  into  a  bracket 
group,  wo  may  havo  a  choico  of  combination.  Wo  may  chooso 
any  ono  on  an  arbitrary  basis,  or  wo  may  havo  built  in  an 
ordor  of  priority  of  choico  which  dotorminos  which  combina¬ 
tion  is  usod  first  for  further  computation.  This  procoduro 
is  equivalent  to  tho  procoduro  usod  with  targot  stratogios 
of  assigning  difforont  ’oxpoctations'  to  certain  typos  of 
brackot  group,  that  is  for  example,  of  ’expecting'  tho  first 
group  of  a  statement  to  bo  a  nominal  group  acting  as  tho 
subjoct  of  tho  sontonco.  Tho  offect  of  such  an  ordor  of 
priorities  is  to  minimize  tho  amount  of  computation  on  a 
probabilistic  basis,  A  priority  rule  is  not  absolute,  a  low 
priority  group  may  in  fact  bo  tho  correct  ono  for-  that  parti¬ 
cular  son  to  now,  cut  in  gouortu.  wo  can  suy  that  an  acceptable 


solution  will  too  produood  with  loss  baok- tracking  than  would 
too  tho  oaso  if  tho  choico  woro  arbitrary. 

In  a  sorios  method  of  computation,  it  is  of  course 
nocossary  to  koop  a  rocord  of  all  tho  choicos  whioh  have  boon 
rnado,  that  is,  of  all  tho  bronchos  of  tho  troo  which  havo 
boon  follovvod  up,  so  as  to  too  suro  that  tho  same  computat¬ 
ion  is  not  ondlossly  ropoatod.  This  was  offoctod  in  tho 
previous  C.L.R.U.  Programme  toy  tho  dovico  of  an  80  bit 
'Choico  Troo'  numbor.  In  this,  all  tho  possiblo  paths  of 
tho  computation  woro  mappod  on  to  a  binary  troo,  in  which 
oach  fork  roprosontod  a  binary  choico.  Ab  soon  as  tho 
choico  was  mada,  tho  corresponding  bit  in  tho  choico  troo 
numbor  was  sot  to  '1',  thus  oxcluding  that  choico  in  tho 
programme  should  that  fork  bo  rovisitod  as  a  rosult  of 
back- tracking.  It  was  possiblo  to  obsorvo  tho  amount  ol 
back- tracking  taking  placo,  by  printing  out  tno  80  bits  of 
tho  choice  troo  numbor.  Experiments  with  this  rovoalod  a 
sorious  difficulty,  which  applies  generally  to  any  sorios 
method  in  this  f iold.  This  .troublo  was  manifested  thraigh 
tho  programme  outputing  a  largo  numbor  of  apparontly  identi¬ 
cal  partial  brackottings  of  a  sontonco,  whioh  could  howovor 
bo  proved  to  havo  boon  oach  obtainod  by  a  difforont 
calculation. 


Any  sorios  mothod  for  oxhausting  a  sot  of  ohoioos  can 
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bo  comparod  to  tho  solution  of  a  mazo.  Wo  may  solve  a 
mazo  by  a  sot  procoduro  which  provonts  us  taking  tho  samo 
path  twioo.  Wa  may  thon  oxhaust  all  poesiblo  paths.  Thus 
whon  wo  roach  a  fork,  if  unmarked  wo  mark  it  and  taka  tho 
loft  path,  if  alroady  marked,  wo  tako  tho  right  path.  If 
ovary  blind  alloy  has  only  ono  path  loading  to  it,  as  is 
true  of  a  roal  mazo,  than  as  wo  travol  oach  path  only  onco 
wo  also  ond  up  at  oach  blind  alloy  only  onoo.  If  howovor 
thoro  aro  moro  than  ono  path  to  any  givon  blind  alloy,  thon 
wo  aro  liablo  to  arrivo  at  that  blind  alloy  onco  for  oach 
path.  Furthormoro,  as  wo  have  no  way  of  knowing  what  sot  of 
paths  load  to  tho  samo  ond-point  (if  wo  know  this  tho  mazo 
would  bo  alroady  solvod),  and  wo  can  only  mark  tho  paths, 
wo  havo  no  way  of  provonting  oursolvoB  from  roaching  tho 
samo  ond-point  a  numbor  of  times.  From  tho  point  of  viow 
of  a  Programme  thon,  wo  havo  no  way  of  provonting  tho 
ropoatod  occurronco  of  partial  brackottings  which  wo  al¬ 
ready  know  will  fail,  and  tho  programme  will  taka  a  vary 
groat  doal  longor  to  oorao  to  its  conclusion  than  would  havo 
boon  oxpoctod  from  tho  numbor  of  possible  brackets  which 
could  bo  formed* 
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To  illustrato  this,  consider  tho  ohoico  troo  bolowj 


Tho  outcomo  points  A,B,C-  -  -  H  roprosont  bracket- 
tings  or  partial  brackottings  of  tho  sentence.  Wo  can 
oxhaust  all  tho  possiblo  paths,  and  thus  oxhaust  all  tho 
possible  outcome  points  in  a  givon  order,  say  A,  thon 
B,  C,  up  to  H,  and  wo  will  roach  each  point  onco  only. 

It  will  in  goneral  bo  tho  case  howovor,  that  sovaral  points 
will  reprosant  tho  s  atno  partial  brackotting.  Thus  points 
A,  C,  F,  and  H  might  woll  all  correspond  to  tho  samo  out¬ 
put,  oven  though  they  aro  arrived  at  by  difforont  paths  in 
tho  programme. 

Thero  appears  to  be  little  doubt  that  a  syntax  pro¬ 
gramme  is  of  tho  form  of  a  maze  whore  any  given  outcome  may 
bo  roachod  by  a  number  of  difforont  paths,  and  thoro  seems 
to  bo  no  roason  to  supposo  that  this  difficulty  would  bo 
any  difforont  if  a  target  stratogy  were  adopted, 

(*A.  F,  Parkor-Rhodos  is  currently  investigating  a  form  of 
Targot  Stsatogy  which  may  partly  ovorcoraa  this  difficulty  )  • 
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This  tnon  is  an  important  disadvantage  of  the  sorios  method. 

It  is  worth  horo  examining  ono  of  tho  advantages  of 
tho  sories  method.  This  is  that  it  allows  ono  to  obtain  a 
corroct  brackotting  of  a  sontonco  without  exhausting  all  tho 
possibilities.  In  a  parallel  method,  all  tho  possiblo 
brackottings  will  be  followed  up,  and  tho  output  of  tho 
programme  will  contain  all  thoso  brackottings  which  Burvivo 
to  tho  end.  In  a  sorios  method,  oven  with  an  arbitrary 
choico  procoduro,  one  complete  output  will  usually  appear 
boforo  all  tho  paths  havo  boon  exhausted,  and  with  an  effi¬ 
cient  priority  order,  an  accoptablo  brackotting  output  may 
bo  achieved  very  quickly.  Wo  may  then  be  contont  to  stop 
and  rogard  this  output  as  tho  final  rosult.  Wo  should  hero 
considar  howovor,  tho  distinction  botwoen  a  brackotting  which 
is  corroct  to  tho  machine,  that  is,  obeys  all  tho  rulos  of 
tho  programme,  and  a  brackotting  which  would  bo  accoptablo 

to  a  person  with  a  good  knowlodgo  of  tho  language.  That 

« 

thoso  two  concepts  of  ’corroctnoss  of  a  brac)ro t-t-ing-  aro 

«  correct 

not  identical  is  shown  by  tho  number  of  machine^  but  obvi¬ 
ously  absurd,  brackottings  which  can  bo  produced. 

It  is  clearly  tho  aim  of  experiment  to  reduce  tho 
divorgonco  botwoon  thoso  two  concepts  of  correctness  as  far 
as  possiblo,  and  to  do  this  with  a  minimum  of  dictionary 
information  and  computor  routines.  On  doing  tosts  on  a 
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particular  programme,  with  particular  algorithms  and  diction¬ 
ary  information,  wo  aro  concornod  with  comparing  maohino 
corroct.  solutions  with  intuitively  corroct  solutions  and  in 
tosting  the  offoct  of  further  information  in  reducing  the 
divorgonco  botwoon  those.  For  this  purpose,  wo  aro  not  only 
intorostod  in  tho  first  machino  corroct  solution  producod, 

^  but  in  all  possible  machino  corroct  solutions.  Only  in  this 

case  can  wo  be  certain,  for  any  given  sentence,  that  tho 
apparont  succosa  of  tho  programme  was  not  due  to  soma  possi¬ 
bly  accidental  effect  of  a  priority  ordor,  ard  that  the 
succossful  result  was  not  Just  ono  of  a  large  number  of  ab¬ 
surd  results,  or  on  tho  other  hand  that  in  an  unsuccessful 
rosult,  tho  corroct  result  would  not  havo  been  obtained  at 
thosocond  try.  Further,  tho  amount  of  information  obtainod 
from  any  ono  tost  is  greatly  incroasod  when  all  possible  out¬ 
puts  aro  obtainod,  and  it  bocomes  loss  important  to  tost  such 
{  vary  largo  bodios  of  toxt. 

Finally,  tho  exhibition  in  tho  output  of  all  tho 
possiblo  brackottings  will  onablo  us  to  tost  the  discrimina¬ 
ting  power,  as  botwoon  those  sovoral  brackottings,  of  other, 
not  necessarily  syntactical,  rules.  Wo  could  then  tost  tho 
powor  of  somantic  procedures  applied  after  tho  syntactical 
analysis,  without  having  to  ombed  these  prooodures  in  the 
aotual  programme.  For  oxporimontal  purposes  at  any  rate, 
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thoro  than  saara  to  bo  strong  advantages  in  favour  of  a  pro- 
coduro  for  exhausting  all  brackotting  possibilities,  and  in 
this  caso  tho  serios  method  is  not  accoptablo, 

Tho  very  consider  able  increase  in  tho  size  of  tho  core 
storo  of  ED8AC  II  has  now  made  it  practicable  for  us  to  use 
a  parallel  computation  mothod,  which  will  provide  as  output 
all  tho  possiblo  maohiijo  correct  braokettings  of  a  sentence, 
and  work  is  proceeding  at  prosont  on  a  programme  to  do  this, 
using  tho  algorithms  described  in  Soction  I,  Before  start¬ 
ing  on  this  programme,  wo  considered  tho  possibility  of  using 
a  binary  brackotting  parallel  procedure,  such  as  has  been 
studied  olsowhore.  [  Privato  communication  from  M.  Kay  of 
tho  Rand  Corporation]  ,  In  this  typo  of  procedure,  it  is 
assumed  that  all  brackots  are  binary  -  that  they  contain 
only  two  constituents.  It  is  thon  possiblo  to  start  at  one 
end  of  a  sentcnco,  comparing  tho  substituents  in  pairs  and 
adding  any  binary  brackot  thus  found  . to  a  list  whioh  is  also 
takon  in  pairs  with  tho  remaining  substituents.  At  any 
stago,  a  brackot  either  is  or  is  not  valid,  as  there  are 
only  two  constituents,  and  any  brackets  which  prove  to  be 
invalid  aro  droppod  from  tho  list  and  havo  no  further  effect, 

Tho  C,L, R.U.  Syntax  Theory  does  not  dofino  a  valid 
bracket  as  on?  that  is  neoessarily  binary,  and  such  a 
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dof inition  is  incompatible  with  our  definition.  But,  if 
such  a  rule  as  o.g, ,  "soak  to  find  only  binary  bracket 
groups",  wore  added  to  our  algorithm,  this  would,  in  cortain 
casos,  lead  to  diff ioultios,  where  (S^,  S2  ,  8^  )  form  a 

valid  bracket  under  our  algorithm  but  neither  (8^  ,  82  ) 
nor  (Sg  ,  8j  )  nor  (8^^  ,  8^  )  form  a  valid  bracket.  If 
wo  do,  novortholoss,  attompt  to  add  suoh  a  rule,  the  follow-* 
ing  disadvantages  are  apparont.  First,  thore  aro  a  number 
of  brackot  groups  occurring  in  language  which  do  not  fall 
oasily  into  a  binary  form,  A  conjunct  group  ('boys  and 
girls’)  is  ossontially  ternary,  and  can  only  be  compressed 
into  a  binary  form  by  adopting  some  arbitrary  conventions. 

The  samo  is  truo  of  groups  such  as  'time  after  time’,  'one 
by  ono '  ,  A  socond  objection  is  that  a  binary  brackotting 
programmo  using  parallol  computation  does  not  really  got 
ovor  tho  branching  difficulty  which  occurs  with  sorios 
computation,  as  described  abovo,  Ono  way  of  expressing 
this  difficulty  would  bo  to  say  that  it  causes  the  pro¬ 
grammo  to  produco  a  number  of  offoctivoly  identical  results, 
which  aro  arrivod  at  in  difforont  ways,  A  binary  brackotting 
programmo,  whon  applied  to  language  whioh  is  not  in  fact 
binarily  brackottod,  produoes  very  much  tho  same  problem. 

To  tako  an  oxample,  a  simple  nominal  group, 

'a  dark  blue  hat' 
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Wo  would  wish  to  bracket  this  as  '(a  dark  blue  hat)'  • 

A  binary  programme  will  find  all  tho  following: 

((a  dark)  (bluo  hat)) 

(a(dark(bluo  hat))) 

(a((dark  blue) hat)) 

((a(dark  bluo))hat) 

(((a  dark) bluo) hat) 

So  far  as  tno  machine  is  concarnod,  thoso  arc  all 
difforont.  So  far  as  wo  aro  concarnod,  tho  programme  has 
mado  an  entirely  arbitrary  distinction,  which  must  bo  re¬ 
moved  in  some  way  or  tho  numbor  of  maehino  correct  brackets 
tings  producod  will  booome  unmanageably  largo.  This  appears 
difficult  to  accomplish  without  running  directly  into  tha 
branching  problora  which  a  parallel  computation  method  is 
dosignad  to  avoid. 

In  offoct,  wo  must  perform  an  additional  computation 
which  involvos  finding  tho  longest  bracket  groups  (those 
with  tho  maximum  numbor  of  substituonts) ,  which  aro  equiva¬ 
lent  to  tho  binary  brackets  producoci  by  tho  binary  bracket- 
ting  programme. 

Wo  thus  considorod  it  nooossary  to  ombark.  on  a 
parallel  computation  programmo  which  makes  no  restriction 
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on  tho  number  of  substituonts  in  a  braokot  group,  and  whioh 
can  also  oxploit  tho  computational  advantages  of  tho  rocur- 
sivo  algorithm  givon  in  Section  I.  Thic  programme  will  use 
tho  second  strategy  described,  that  of  finding  tho  inner 
brackets  first,  and,  as  with  tho  previous  programmo,  will 
start  at  the  end  of  tho  sontonco.  Wo  start  at  tho  end  of 
tho  sontonco  for  tho  same  roason  that  a  programmo  using  a 
targot  stratogy  starts  at  tho  beginning  of  a  sentonoo  -  it 
is  an  ompirical  fact  that  in  any  language  tho  normal  brackot- 
ting  structure  is  ono  in  which  tho  brackets  cluster  up  at 
tho  ond,  Thus  ( (ab) (c(d(of ) ) ) )  ?  s  common,  while 

((((ab)c)d) (of))  is  uncommon. 

As  Yngvo  (21)  has  pointod  out,  thoro  appears  to  bo  good 
roason  for  this,  in  torms  of  the  memory  capacity  required 
for  a  givon  depth  of  brackotting  using  a  targot  strategy. 

In  too  programmo  stratofcy  wo  aro  using,  finding  tho  innor 
brackots  first,  wo  can  savo  storage  spaco  if  wo  find  a  com- 
ploto  innor  brackot  group  quickly,  and  it  is  thus  more 
efficient  to  start  at  tho  ond  of  a  sontonco, 

Tho  procoduro  of  this  programmo,  which  determines  tho 
ordor  in  which  substituonts  aro  comparod  by  the  algorithm 
and  made  into  brackot  groups,  is  unfortunately  very  much 
more  complex  than  that  whloh  oan  bo  used  in  tho  case  of 
binary  braokots.  In  tho  latter  case,  any  putative  bracket 
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group  con  only  havo  two  mombore,  and  is  unambiguously 
oithor  valid  or  not  valid,  Koithor  of  thoso  oonditions 
holds  in  the  gonoral  caso  of  non-binary  brackets,  Tho 
first  cloarly  cannot  bo  truo,  and  tho  second  is  three  valued. 
To  go  into  tho  lattor  rroro  dooply,  it  is  true  that  by  the 
algorithm  wo  can  say  whothor  any  givon  string  of  substitu¬ 
ents  Sg,  S^,  , ,*.*Sn  is  or  is  not  a  valid  bracket, 

takon  by  itsolf.  For  tho  purposes  of  obtaining  a  complete 
brackotting  of  a  sontonoo,  howovor,  wo  nood  to  know  more 
than  this,  Wo  nood  to  know,  givon  again  a  string  of 
substituents  8^,  Sg,  ,,,  SQ,  first  whothor  (8^  Sg)  is 
valid  or  not,  and  sacondly  whothor  the  longer  string 
(Si»  Sg,  ,,,  &x)  might  bo  valid  so  far  as  tho  information 
concoming  and  Sg  is  concornod.  As  has  boon  soon  in 

tho  soction  on  tho  algorithm,  tho  algorithm  givos  us  throo 
possiblo  cases,  (l)  Tho  substituents  cannot  occur  together 
in  a  valid  brackot  group,  (ii)  Tho  substituonts  do  not 
form  a  valid  brackot  group  as  thoy  stand,  but  might  form 
ono  when  furthor  substituonts  aro  addod  to  tho  group  (o.g, 
a  governor).  And  (iii)  Tho  substituonts  form  a  valid 
brackot  group,  in  which  oaso  they  might  also  still  form  oao 
whon  further  substituents  aro  addod  to  the  group. 

In  tha  plaoo  of  tho  ono  list  of  binary  brackets,  wo 
roquire  two  lists,  ono  for  brackets  whioh  at t  known  to  be 
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valid,  and  ono  for  brackets  which  aro  potentially  valid* 

On  tho  other  hand,  thoso  lists  will  not  grow  to  tho  extent 
of  tho  binary  list,  as  tho  ’a  dark  blue  hat*  tremble 
doscribod  above  cannot  occur,  Tho  possible  bracket  groups 
in  that  case  will  be: 


a  dark  (blue  hat) 
a  (dark  bluo  hat) 

(a  dark  bluo  hat) 

Tho  possibility  of,  for  oxamplo,  (dark(bluo  hat))  is 
specifically  oxcludod  by  the  dictionary  ontry  for  a  substi¬ 
tuent  typo.  An  ondocontric  group  may  not  bo  a  mombor  of  a 
group  of  tho  same  typo.  Thus  (bluo  hat),  as  a  nominal 
group,  will  not  bracket  with  ’dark'  in  another  nominal 
group,  and  (dark(bluo  hat))  cannot  bo  exhibited  as  a  nomi¬ 
nal  group,  although  (dark  bluo  hat)  can  bo  so  exhibited. 

Wo  would  also  bo  ablo  to  doal  straightforwardly  with  tornary 
groups  such  as  conjunct  groups, 

optimal  soarch  procoduro  using  such  lists  has  not 
yot  boon  fully  worked  out,  and  tho  practical  problems  in- 
volvod  aro  considerable.  There  is  no  theorotioal  reason 
why  such  a  procoduro  should  not  bo  found,  for  this  is 
essentially  an  offectivo,  finito  problom. 
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Tho  programme  will  bo  ablo  to  deal  with  any  recur¬ 
sively  defined  algorithm  givirg  rules  of  combination  of 
substituents  into  brackot  groups.  Provision  is  being  made 
for  throe  difforont  algorithms,  oaoh  operating  on  the 
information  contained  in  one  40  bit  computer  word,  Thoso 
will  bo  usod  for  tho  Governor-Dependent  information,  tho 
Word-Ordor  information,  and  tho  Concord  information  as 
doscribod  before,  but  other  algorithms  operating  on  other 
information  could  bo  substituted  without  muoh  difficulty. 


PHOT ION  3 


The  Algebra  of  Punched  Cards 
INFORMATION  i  QUESTION  AND  ANSWER  : 

As  R.Q,  Collingwocd  (3)  emphasized  scientific  know¬ 
ledge  may  only  be  arrived  at  by  asking  questions.  Wo  cannot 
of  course  prove  this,  but  it  provides  a  definition  of 
'information*  or  ' content; '  ;  at  least  it  provides  us  with 
a  definition  that  enables  us  to  have  computable  information, 

(l)  If  a  question  is  answered  (say  by  a  dictionary-maker) , 
i,e.  if  a  numerical  value  (1  or  0  or  Prob  e/b)  has 
been  assigned  to  it,  then  the  answer  is  w  ell-formed,  and  the 
function  to  which  the  value  has  been  assigned  is  well- 
defined  and  thus  Turing-computable, 

(?)  If  the  answers  to  the  questions  are  definite, 
e,g,  the  result  of  a  football  match  -  Win,  Lose,  Draw, 

i.e,  if  one  of  a  finite  set  of  valuss  are  assigned  (e.g, 

'Yes'  or  'No'),  then  the  answers  may  be  treated  as  well- 
forraed  formulae  in  an  n-valued  logic,  where  n  is  finite; 
and  in  this  case  it  may  be  reduced  successively  to  the  two¬ 
valued  case.  This  is  not  in  general  true  when  the  third 
value  is  "doubtful".  Thus  (3)  Where  the  answers  to  the 
question  are  'Yen/No',  they  constitute  a  two-valued 


47. 

propositional  oalouluss  each  anewer,  conventionally 
written  *1*  or  ’O’  will  he  a  proposition,  or  a  bit  of 
information* 

Y/e  shall  here  consider  the  third  case  in  detail. 

First,  because  punched  card  equipment  fundament ally  produces 
only  two  states,  for  either  it  punches  a  hole  or  it  does 
(  not;  ite,  it  performs  only  two  operations,  it  punches  a 

'hole*  or  it  punches  a  'not-hole',  Second,  because  the 
properties  of  the  circuits  used  in  punched  card  equipment 
have  been  known,  since  Shannon  (14)  to  be  analogous  to 
the  Boolean  algebra  #f  propositions.  Third,  because  the 
use  of  three-valued  or  n-valued  logic  as  proposed  by 
Lukasiewicz,  (ti),  Reichenbach  (13)  or  von  Wei^hcker 
(21),  is  not  generally  agreed  to  be  valuable.  Fourth, 
because  in  our  syntactical  investigations,  we  have,  up  to 
now,  compelled  1  or  0  answers  by  framing  the  questions 
about  words  and  substituents  in  terms  of  the  possibility  of 
the  occurrence  of  a  property  (thus,  e0g.  wa  have  not  asked 
'Does  this  word  occur  as  a  governor  in  a  nominal  group?) 
but  'Can  it  occur.,,.*?). 

The  Logical  Algebra 

The  logic  which  we  shall  use  to  interpret  punched 
card  computation  is  elemantary:  l.e.  it  is  the  Boolean 


1*8 


algebra  of  propositions,  or  classes  or  properties  (for 
these  may  be  mapped  iscmcrphioally,  •-t.o  each  other)  0 
Originally  formulated  by  George  Boole,  (l)  and  (2),  it  was 
adapted  by  C„S,  Peirce  (ll)  and  E.  Schroder  (16)  and 
expounded  systematically  by  C.  I.  Lewis  (7)»  It  has  always 
been  known  that  a  complete  decision  procedure  for  such  a  logic 
exists:  we  shall  not  be  considering  tha  Calculus  of 
Predicates  or  any  calculus  involving  unlimited  quantificat¬ 
ion,  since  it  has  been  known  since  Turing  ( 18)  and  Church 
(1*)  that  there  exists  no  decision  procedure  for  suoh  a 
calculi,  and  therefore  no  complete  computable  punched  card 
representation  could  exist,  though  Boole  himBelf  did  give  a 
representation  of  a  calculus  of  predicates  in  terms  of  his 
algebra. 

Though  it  is  not  essential  to  the  exposition  we  shall 
choose  che  propositional  interpretation,  and  follow  the 
computable  rules  suggested  by  B,  Hilbert  and  B.  Bernays 
(5)  and  D,  Hilbert  and  W.  Ackermann  (6),  We  shall  also 
use  the  closely  connected  Truth- table  notation  of  E„ L,  Post 
(12)  and  L,  Wittgenstein  (20)„  We  use  the  following 
notation 

* p, » o  q»  •  •  r, » *  represent  propositional  variables  whose 
values  are  the  names  of  propositions. 
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•pi^q*  read  'p  cap  qf  will  represent  ’and’  i.e* 
logical  conjunction  or  intersection.  It  is  defined  by  the 
truth table 

£>  P  Q 

111 
0  0  1 
0  10 
0  0® 

'pUq'  read  'p  cup  q*  will  represent  'or'  i.e, 
logical  disjunction  or  union.  It  is  defined  by  the  truth- 
table 


U  p  q 


111 
10  1 
110 
0  0  0 


p  read  'not-p'  will  represent  classical  comple¬ 
mentation  or  negation.  It  is  defined  by  the  truthtable 


-  P 

0  1 
1  0 

*  ft)  3  I*  read  '  ft  is  a  tautology'  will  represent 
the  assertion  that  the  formula  is  true  no  natter  what  the 
values  of  the  constituent  variables 
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\ 


It  is  defined  by  the  truthtable 


0  =  1  \  P  Q 


1  11 

1  0  1 

1  10 

1  '00 


'  =  0  '  read  *  jh  io  a  contradiction'  will 

represent  the  assertion  that  the  formula  is  false  no  matter 
what  the  values  of  the  constituent  variables. 


It  is  defined  by  the  truthtable 
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0 

0 

0 

0 


1  1 
0  1 
1  0 
0  0 


It  has  been  known  for  a  long  time  that  these  functions 

are  sufficient  to  define  any  other  truth-function  of 

of  the 

elements  (of  the  values  /  elementary  propositions  given 
by  the  bits),  of  which  in  the  case  of  two  valued  logic 
there  are  2  ,  where  n  is  the  number  of  elements. 

E.g,  'p — 9q'  may  be  defined  as  p  U  q  ('material 
implication'),,  .  and  'Equivalence'  as  p£r*  q  , 
thus  permitting  algebraic  computation,  because  equations 
can  be  written  in  the  notation,  and  Boole's  original  addi¬ 
tion  operation^  ■  may  be  defined  as  pnq  u  pn  q  • 
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They  are  also  sufficient  to  set  up  this  algebra  axiomati- 
cally,  whioh  is  uninteresting  for  computation, 

9e  rhall  use  the  following  properties  of  these  funct¬ 
ions 

(1)  U  and  H  are  idempotent,  commutative,  asso¬ 
ciative,  and  distributive  over  each  other,  i.e,  the 
following  equivalences  hold: 

(d)pu(qnr)  =  (puq)n(pur)  (a) 

and  pn(<JJr)  =  (pnq)u(qnr).  (*>) 

(c) 

(2)  —  is  classical  i.e* 
hold: 

(a) 

(b)  p  n,  q=  p  ug 

pnq  (e)  0  =.  I 

(3)  The  negation  of  any  function  jb  (p...q)  may  be 
constructed  by  replacing  each  constituent  bit  by -.its  nega¬ 
tion,  each  occurrence  of  U  by  H  ,  each  occurrence  of 

O  byu  ,  and  each  occurrence  of  I  by  0,  and 
0  by  I, 


p  3  (pn  pMp  up) 

(pnq)  =  (q  np)  ,  and 

(puq)=»(q  Up) 

pn  (qnr)  =»  (pnq)  Or  and 

P  U  (qur)  »  (puq)  Ur  , 

the  following  equivalences 

P3P  (d)  1*0 
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( 


{ 


(4) 

•k 

■Q. 

n 

M 

c 

"Q 

n 

0=0 

j6  u  i  =  i  , 

jb'JO 

=  Jb 

(5) 

Where  an  equivalence 

/b  =<  JIf  is  shown  to  be  a 

tautology,  another  equivalence  which  is  its  dual,  may  be 
obtained  (if  fi  and  /  consist  only  of  elements  and  their 
negations  connected  by  n  andu  ),  simply  by  interchang¬ 
ing  Hand  U  throughout  the  formula.  Thus  if 


|p..q)  =  ^  (p. „qj] 

fi'  [(p..q)  =  X'  (Po.q)] 
ed  such  that 

(p...q)=  ?'  (p.*q) 

(p..q) j  P  (p» « q) 

only  in  that  U  is  replaced  byH 


I  then 

I  may  be  construot- 

differs  from 

and  H  le  replaced  by  u  * 


(6)  Prom  (3)  and  (5)  it  follows  that  for  any  formula 

/6  »  I  there  will  be  another  formula  =0  constructed 

by  replacing  eaoh  bit  by  its  negation,  and  ft  by  u  . \  and  U 
by  H  •  Thus  one  may  eitner  compute  whether  a  formula 
is  =*I,  or  by  using  this  rule,  arrive  at  an  equation, 

J0*  »  0  to  which  there  is  always  a  solution. 

(7)  For  any  formula  j6  (p.  *.q)  there  exists  a  repre¬ 
sentation  in  the  so-called  Normal  Forms*  The  Conjunctive 
Normal  Form  la  a  Conjunction  of  brackets,  where  eaoh 
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bracket  contains  a  Disjunction  in  which  each  element  is 
an  elementary  proposition  or  a  negated  elementary  propo¬ 
sition,  The  Disjunctive  Normal  Form  is  a  Disjunction  of 
brackets  where  each  bracket  contains  a  conjunction  in  which 
each  element  is  an  elementary  proposition  or  a  negated 
elementary  proposition. 

(8)  Using  p=>pH  (qUq),  the  Expansion  or  Development  of 

u 

p  with  respect  to  q  is  (pDq)  4>  Dq)»  Hence  any  function 
of  elements  (p,,0q)  may  eleo  be  treated  as  a  function 

of  any  other  element  n,  though  originally  there  is  no  in¬ 
formation  concerning  n, 

(9)  The  negative  of  a  formula  may  be  constructed  by  form¬ 
ing  the  Expansion  of  I  with  respect  to  all  the  elements 
that  appoar  in  the  expression  For  I  =*  pup 

Thus  I  -  (p  U  p)  n  (q  U  q) 

=  (pjp)n(quq)n(xur)  . . . . .  (n  M  n) 

Then  the  negative  or  any  fully  eapanded  expression  is  the 
remainder  of  the  expansion  of  one.  This  may  be  written, 
following  Boole,  (l)  (1  -  J0).  This  enables  us  to  intro¬ 

duce  —  subtraction  (the  inverse  of  ♦  or  Boole' o 
addition  operation).  We  have  thus  got  an  arithmetic  defi¬ 
nable  in  the  logic,  and  oan  compute  any  natural  number  in  it* 
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(lO)  There  is  a  representation  for  any  formula  in  the  so- 
called  'Ausgczoichnete  Normalf orm’ ,  the  Exceptional 
Normal  Form,  This  is  constructed  "by  developing  the  formula 
(pH  p)  u  (qOq)  ,  (nfiE),  in  accordance  with  the  dis¬ 
tributive  lawf  Where  n  consists  of  all  the  elements  re¬ 
quired,  This  will  produoe  an  expression  in  the  Conjunctive 
Normal  Form,  and  each  bracketted  part  of  this  expression 

will  be  equivalent  to  a  distinct  truth-function, and  one  set 
brackets 

of  these/will  be  equivalent  to  the  required  formula  (except 

the  degenerate  case  of  the  tautology,  which  is  represented 

by  the  zero  set  of  such  brackets).  In  order  to  produce  the 

Exceptional  Normal  Form  of  an  expression  it  must  first  be 

reduced  to  the  Conjunctive  Normal  Form,  Assuming  that  the 

elements  in  question  are  p.,..»n),  wo  drop  all  brackets 

of  the  form  (p  u  p)  and  write  p  for  p'J  p  throughout. 

Then  each  remaining  bracket  will  be  a  disjunction  of 

distinct  elements  of  the  sequence  p, , ,n.  If,  in  any 

bracket,  both  q  and  q,  say,  are  lacking,  we  may  extend 

tho  disjunction  by  the  element  (q  n  q)  (for  this  =*0)  and 

apply  the  distributive  law  until  each  bracket  contains, 

either 

for  each  element  p,/  that  element  or  its  negation, 
(Since  O  is  commutative,  distinct  orderings  are  equiva¬ 
lent  and  only  one  need  be  written. 


Because  this  representation  corresponds  to  the  truth- 
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table  notation,  two  formulae  which  have  the  same  represen¬ 
tation  in  this  Normal  Form  are  truth-equivalent:  this 
enables  us  to  give  a  unique  representation  of  any  formula* 

V?e  may  arrive  at  the  simplest  form  of  a  formula  by  giving  it 
this  representation  and  applying  the  Elimination  Rule  stated 
in  Hilbert  and  Ackermann  (6) : 

(itf  u?  )  n(,Ju^  )a  (  0  n  u  y  3  jg 

(ll)  In  order  to  enable  us  to  write  down  in  a  mechanical 
fashion,  the  algebraic  formula  corresponding  to  a  truthtable, 
we  assign  truth-values  in  the  following  manner,  which  is  both 
fixed  and  exhaustive:  1010  for  the  first  element,  1100 

for  the  second  and  so  on. 

Since  punched  card  equipment  is  designed  to  record 
information,  it  is  obvious  that  there  will  be  a  representa¬ 
tion  of  this  algebra  in  the  cards  and  appropriately  designed 
equipment.  The  equipment  we  consider  here  is  the  Keypunch. 
The  Duplicating  punch,  the  Reproducer  Punch,  the  Sorter, 
and  the  Collator,  and  soma  actual  and  possible  modifications 
of  these, 

T&£_C££&: 

The  card  to  be  considered  is  the  I.C.T,  407-353. » 
containing  12  rows  and  80  columns,  i,o.  there  are  960 
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meohanically  distinct  spazes.  On  each  of  these  a  hole  nay 
or  may  not  be  punched  (here  the  hole  is  exclusive  to  the 
"not-holo ,Mas  ’Yos”  is  exclusivo  to  wNoM  ;  thus  convention¬ 
ally  a  hole  is  taken  to  represent  ttlw  ar.d  a  blank  space  is 
taken  to  represent  W0M.)  It  is  worth  noting  that  the  oppo¬ 
site  interpretation  is  possible  and  some  interesting  results 
follow  from  the  possibility  of  reversing  the  interpretation; 
we  shall  return  to  this. 

The  card  is  a  two-dimensional  space  which  iB  divided 
conventionally  into  12  rows  and  80  columns  on  to  which  in¬ 
formation  is  punched.  However,  information  appearing  on  any 
card  can  be  transferred  mechanically  from  any  sot  of  columns 
to  any  other  sot.  Thus  the  order  of  tho  information  in  the 
columns,  which  may  easily  be  taken  to  be  significant,  may 
be  changed  or  shuffled,  by  the  use  of  tho  Reproducer  punch. 
Since  any  substitution  of  columns  is  possible,  the  card  may 
be  conceived  of  as  bent  round  a  cylinder,  and  rotation  of 
colufnn  and  column  information  is  possible. 

It  is  not,  in  general,  possible  to  transfer  informa¬ 
tion  from  row  to  row  in  the  same  way,  but  to  some  extent 
this  may  be  overcome  by  the  rotations  of  the  card,  which  we 
consider  here  for  tho  first  time. 


The  single  card,  which  is  an  essential  element  in  our 
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considerations,  is  capable  of  two  independent  rotations, 
i.e,  any  other  rotation  of  the  card  is  a  combination  of 
those  two  (but  not  of  course  the  mapping  of  column  information 
on  to  othor  columns  reforred  to  above).  These  rotations  are 
(l)  Abibut  the  small  diameter  of  the  card,  which  we  call  the 
Right-Left  rotation,  and  (2)  About  tho  large  diameter  of 
the  card,  which  wo  call  tho  Up-Down  rotation.  If  we  are 
propared  to  sacrifice  some  of  the  information-capacity  of 
the  card  by  using  only  the  480  bits  on  the  Left-hand  side  of 
the  card  (columns  1-40 ) ,  then  the  Right-loft  rotation  makos 
possible  comparison  of  information  on  the  Right-hand  side  of 
tho  card  with  information  on  tho  Left-hand  half.  Similarly, 
if  wo  aro  prepared  to  sacrifice  information  carrying 
capacity  on  the  Down-side  half  of  tho  card,  the  Up-Down  ro¬ 
tation  makes  possible  comparison  of  information  on  the  Up 
480  bit  half  with  information  on  tho  Down  480  bit  half.  And 
if  wo  aro  prepared  to  sacrifice  information  capacitj&^till 
further,  wo  may  divide  the  card  into  quadrants  of  240  bits 
oach,  Tho  two  rotations  then  make  possible  comparison  of 
information  on  any  part  of  tho  card  with  information  on  any 
othor  part  of  $he  card.  These  operations  are  conveniently 
performed  on  tho  Duplicating  punch,  modified  only  to  permit 
two  cards  to  bo  inserted  simultaneously  into  the  reader,  but 
which  operates  fundamentally  on  single  cards  or  a  pair,  not 
a  pack. 
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Another  extension  of  the  range  of  computation  is 
achieved  by  considering  the  three-dimensional  array  of  a 
sot  of  cards  placed  one  on  the  other,  or  a  pack.  This  may 
be  arranged  in  any  order.  The  Reproducer  Punch,  the  Sorter, 
and  the  Collator  are  constructed  to  compute  over  a  pack, 

Tho  Key- Punch; 

Tho  keypunch  has  only  ono  basic  operation  -  and  thiB 
Is  true  of  all  punched  card  equipment  -  it  punches  a  hole 
in  a  given  space.  This  operation  is  asymmetiical,  i.e, 
a  blank  space  or  a  ’not-hole’  is  immediately  converted 
into  a  'hole’,  (thus  a  'O'  is  mechanically  converted  into 
a  *1',  but  without  further  change  in  the  equipment,  a  ’hole* 
cannot  bo  converted  into  a  * not-holo',  thus  a  *1'  cannot 
be  convertod  into  an  ’O’.  )  It  is  immediately  obvious  that 
this  .statomont  may  be  changed  symmetrically,  by  taking  tho 
’hole’  to  mean  '<>  f  and  the  ’not-holo'  to  moan  '1*  but 
the  asymmetry  still  remains  and  a  reversal  of  interpretation 
may  not  bo  done  simultaneously.  Prom  this  it  imim  diately 
follows  that  thoro  does  not  oxist,  in  the  punched  card 
algebra,  a  gonoral  mechanical  definition  of  negation  or 
complementation  as  defined  in  our  truth-table.  There  are 
important  consequonces  to  this  and  we  shall  return  to  the 
whole  question  of  computing  tho  negation  of  a  formula. 


The  Do  tailed  Roprosentatlon  of  this  Algebra  on  Punched  Cards 

A  hole  represents  the  value  * 1 *  given  to  a  variable, 
say  * p * ,  A  blank  space  or  '  not-hole*  represents  the  value 
*1'  assigned  to  a  variable,  say  1 p *  ;  it  thus  represents 
the  value  'O’  assigns  to  '  p’* 

Tho  card  itself  ropresents  a  function  /)  (p^,  Pg»  •  •• 

Pggo)  the  valuos  of  tho  elements  p^,  •••2960*  or  t*1Q 

bits  of  information.  Thus  if,  for  the  sake  of  simplicity, 
wo  only  consider  the  caB6  of  two  elements  'p*  and  * q *  , 
tho  number  of  cards  necessary  to  represent  all  distinct 
truth-functions  of  these  elements  will  bo  16.  Of  these  one 
card  will  be  blank,  and  this  will  represent  =0  and  one 
will  havo  four  holes  *■  (=*l),  ^he  complete  punched  card 
universe  of  information  is  a  card  with  960  holes  on  it 

(this  may  be  multiplied  by  taking  packs  of  cards,  with  a 

certain  inconvenience  in  considering  tho  rules  for  reading 
tho  cards):  but  the  universe  with  respect  to  the  n  ele¬ 
ments  of  information  under  consideration  is  less:  it  is  2n, 

Since  any  problem  in  this  logic  may  be  formulated  as 
asking  whether  a  funotion  0  =*  1  or  =»0,  or  whether  some 
function  of  functions  of  elements  Xand  the  value s  of  those 
functions  are  themselves  =1  or  **0  ffs  »1  or  =o),  any 


problem  which  consists  of  computing  the  value  of  a  function 
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of  functions,  may  bo  represented  by  a  pack  of  cards  and  the 
solution  will  be  an  output  card,  which  is  =1  or  =0. 

Because  all  cards  may  bo  given  a  complement-card,  by  con¬ 
structing  one  in  accordance  with  the  punching  rule  described 
below,  and  the  computation  of  the  equivalence  of  two  funct¬ 
ions  is  always  roduoiblo  to  an  equation  C. , . . . ]  =  0, 

there  are  four  remaining  problems  that  arise  :  (1)  to  comr* 
pute  fb  ,  or  the  union  of  the  information  of  a  pack 

of  cardsj  (2)  to  computetho  conjunction  or  the  intersection 
of  the  information  on  a  pack  of  cards  or  the  formula  jE!  n  JJ  ,  j 
(3)  to  compute  (a)  whethor  a  particular  card,  representing 
at  least  an  intersection  of  properties,  F,G.H,  exists  or 
does  not  exist  inta  pack*  and  (b)  whether  a  card  identical 
to  a  particular  card,  i.e,  representing  only  F,G,H,  exists 
or  does  not  oxist  in  a  pack;  ( k )  to  compute  the  complement 
or  the  negation  of  a  given  function  or  card,  i.e,  to 
c  oraputo  j6  , 

There  are  genoral  mechanic  al  solutions  to  the  first 
three  problems,  and  though  our  punched  card  logic  contains 
no  operation  of  negation  (such  as  would  be  provided,  say, 
by  a  conditional  switoh  which  would  reproduce  from  a  card 
fb  its  complement,  fb  by  punohing  a  hole  where  there  was  a 
'not-hole*  and  a  'not-hole'  where  there  was  a  ’hole1)  we 
havo  a  general  solution  to  the  fourth  problem.  Although 
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wo  cannot  in  general  perform  algebraio  computation  in¬ 
volving  negation,  we  can  always  compute  the  actual  values 
of  any  well-formed  formula,  even  when  it  involves  negation 
essentially » if  wo  are  given  the  values  roquired  to  do  the 
positive  computation, 

(1)  Tho  general  solution  to  (l)  is  available  as  soon 
as  it  is  seen  that  the  operation  of  duplicating  n  cards 
(for  the  sako  of  simplicity  wo  consider  only  two),  when 
performed  successively  on  to  the  same  output  card,  will 
produce  tho  disjunction  or  the  union  of  all  the  bits  or  the 
values  of  the  elementary  propositions  on  both  cards.  Thus 
wo  will  havo  ^  with  all  the  holes  there  are  on  as 
well  as  all  tho  holes  thero  are  on  V  .  This  provides  a 
general  mechanical  definition  of  j6  U  and  by  the  Law  of 
Duality  of 

jj  n  jff  . 

(?.)  The  general  solution  to  (2)  is  similarly  provided 
by  tho  basic  punching  operation.  If  two  cards  are  placed 
on  each  othor  ft  tho  roador  (as  in  a  Duplicating  Punch  modi¬ 
fied  to  read  two  cards  simultaneously),  the  holes  on  the 
output  oard  f  will  be  those  that  are  both  on  and 
on  V'  ,  In  othor  words  we  can  provide  a  general 
mechanical  definition  of  /6  n  Jtf  •  And  by  the  Law  of 
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Duality  of  (  0  U  Y  )  «  The  Dual  forma  are  important 
because  ono  may  either  reverse  the  interpretation  of  1  and 
0  or  punch  the  complement  of  the  information  on  the  card. 

(3)  The  general  solution  to  this  problem  is  provided 
by  the  solution  to  the  other  logical  problems.  We  will 
reserve  detailed  consideration  of  this  problem  until  we 
describe  some  of  the  possible  operations  of  the  Sorter,  the 
Collator  and  the  Reproducer:  generally  speaking  this  is 
the  problem  of  solacting  a  function  i.e.  computing 
whether  in  a  pack  there  is  a  oard  whoso  function  has  a  cer¬ 
tain  relation  to  a  pattern-card  or  function, 

(U)  The  general  solution  to  tho  problem  of  negation 
must  bo  sought  in  the  logic  itself.  It  is  clear  that  the 
existence  of  U  and (1  and  any  function  of  these,  suffices 
to  construct  a  non- complemented  distributive  lattice  or  a 
'positive  logic*  in  tho  sense  of  Hilbert  and  Bornays  (5)  * 
it  is  therefore  impossible  to  construct  all  the  formulae 
which  aro  permitted  in  a  complete  logical  algebra.  And  tooth 
of  tho  rules  wo  have  given,  o.g.  (3)  and  (9)  of  tho 
properties  of  tho  algebra,  which  aro  inddpondent  of  eaob 
other,  use  the  operation  of  negation,  and  it  would  appear 
that  any  computation  of  the  value  of  a  formula  whoso  repre¬ 
sentation  in  the  Exceptional  Normal  Form  essentially  involves 
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a  negation  sign  oannot  be  done  without  the  addition  of  a 
cirouit  or  conditional  switch,  that  will  reproduce  a  card 
undor  the  symmetrical  condition  that  a  holo  is  replaoed  by 
a  not-holo  and  a  not-hole  by  a  hole.  Certainly  it  would  bo 
vary  conveniont  to  havo  such  a  conditional  transformation, 
but  it  is  not  nocossary,  sinco  wo  may  instead  double  the 
numbor  of  olomonts  in  our  univorso  by  assigning  to  each 

(  *  1  *  aft  ’O’  and  to  each  'O’  a-  *  1 *  ,  Wo  widen  our 

algobra  by  troating  all  negations  as  if  they  were  positive 
olomonts:  to  each  of  tho  original  elements  there  corres¬ 
ponds  a  now  ’positive1  olement  whoso  relation  to  the  origi¬ 
nal  one  is  exactly  that  provided  by  a  negation  operation. 

In  connection  with  the  interpretation  in  terms  of  quostion 
and  answor  it  is  worth  noting  that  ’Yes,  p  is  not  true' 
is  truth-o quivalont  to,  (is  the  samo  as),  ’Yes,  p  is 
true’,  or  ’p’  i,  e,  ’No’, 

(  Wo  may  construct  the  complement  of  a  given  card  -  or 

any  olement  of  which  a  card  is  a  function  -,  by  the  following 
punching  rule:  wherever  there  is  a  ’1'  on  tho  card,  punch 
an  ’O',  which  we  may  call  its  ’positive  negations  on  the 
other  card,  i»o,  do  not  punch  in  that  space:  wherever 
thore  is  an  ’O'  on  the  card,  then  punoh  a  '1',  i.e,  a 
hole  on  tho  other  oard.  It  is  clear  that  the  intersection  of 
those  two  oards  representing  functions  0  and  0 
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is  »0,  therefore  jtf  is  =>  ]6  ,  ConsiderablQ  subtlety  may 
bo  introduced  here  by  considering  the  Up-Down  partition  of  the 
card  into  two  halves  of  480  bits  oach  as  constructing  two 
cards  of  which  the  Up  and  Down  may  be  seen  to  be  comple¬ 
ments  if  the  same  punching  rule  is  followod.  A  card  f6 
will  then  contain  two  sections:  Rows  Y,  X,  and  1-4 
will  contain  tho  information  and  Rows  5-12  will  contain  .J. 
the  complement  of  this  information.  Of  course  such  ooding 
will  not  permit  tho  original  card  also  to  bo  treated  as 
ono  function  of  elements  i, e,  j6  (Up-Down)  is  not  a 
function  in  our  Boolean  Algebra  whore  tho  Up  section  re¬ 
presents  say  and  the  Down  section  represents  its  nega¬ 
tion  -  jt)  ,  For  any  operation  of  union  or  intersection  on 
the  total  card  will  produco  a  card  j tf  (Up-Down)  in  which 
the  Up  and  Down  sections  ar3  not  complements. 

We  now  note  thfll  fact  that  any  function  $  (p, 

of  n  elements,  has  a  Normal  Form  representation  whore  the 
bracketting  in  the  formula  is  of  only  ono  level.  Wo  may 
thus  have 

(pnqHr)  U  (pOqOs)  u  (pPqnr),,,.  and  not  (pHqnP)  u 

C(pnq)n§3  • 

The  negation  sign  occurs  only  over  the  elements.  Lot 
(p»..n)  be  such  a  funotion  in  the  Normal  Form  with  n 
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elements  In  a  complete  logio  that  includes  negation.  We 
define  m  now  oloments  (whore  m  $  n)  called  p*  ,  q*  ,  r* 
etc.,  by  the  rule  p'  =p,  q*  =*q,  r*  =r>...  m*  =m»  For 
every  occurrence  of  p,  q,  n,  in  0  substitute  p’,  q’, 

,  ...m’ •  We  now  have  a  new  function  0  (p^  p^,  q^  q|  r^  r, ) 

with  (n+m)  elements  which  is  also  in  the  Normal  Form,  but 
is  expressed  in  a  positive  logio.  Clearly  0  and  0  are 
truth-equivalent  and  as  0  is  expressed  in  a  positive 
logic,  0  and  hence  0  are  computable  on  positive  punohod 
card  machinery.  Thus,,  given  any  function  0  (n)  in  a 

comploto  logic,  there  is  a  fixed  rule  for  converting  it  into 
an  o quivalent  function  of  not  more  than  2n  elements,  in 
a  positive  logio. 

In  practice,  this  procedure  does  not  cause  any  incon¬ 
venience,  provided  that  the  necessity  for  it  i6  realised 
from  the  start,  when  the  data  to  be  processed  is  first 
punched  on  to  cards.  The  standard  punching  convention  is 
to  punch  a  hole  for  a  '1*  (for  a  true  proposition)  ana  to 
do  nothing,  (i.e.  push  the  space  bar)  for  an  'O’,  If  the 
computation  to  be  performed  on  the  data  will  require  nega¬ 
tion,  it  is  essential  to  effect  the  substitution  of  p'*p 
at  the  input  stage,  by  punching  a  hole  in  one  position  for 
a  *1*  and  a  hole  in  a  different  position  for  an  'Of,  We 
might,  e.g,  punch  a  hole  in  row  1  for  a  ’l’  and  a  bole 
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in  row  6  for  an  *0',  thus  allowing  ua  to  use  tho  row 
transposition  from  Y-3  to  9-4,  effected  by  performing 
the  Up-Down  rotation  of  the  card,  to  take  union  or  inter¬ 
section  with  the  element  p,  or  its  complement  p  =»p'.  It 
is  true  that  half  the  capacity  of  the  card  is  lost,  but 
this  punching  rule  does  not  in  fact  increase  the  number  of 
key  depressions  when  punching  tho  data. 

( 

We 'wish  to  emphasize  once  again  that  our  solution  of 
the  problem  does  got  si'ggest  that  a  positive  logio  is  com¬ 
plete  In  tho  sense  in  which  a  logic  containing  negation  is 
complete;  (here  "a  logic  containing  negation"  is  meant  to 
include  a  logic  containing  operations  such  as  Shaffer' s  (15) 
stroke-function3,  f  or  Boole's  exclusive  or  , -4- ,  in 
terms  of  which  a  negation  function  is  easily  definable). 

All  wo  assert  is  that  becauso  there  is  a  Normal  Form  repre¬ 
sentation  for  any  formula  such  tnat  negation  signs  appear 

(  only  over  the  elements,  the  value  of  a  formula  containing 

negations  essentially  may  nevertheless  bo  computed  by 
doubling  the  number  of  elements  in  the  algebra*  the  neces¬ 
sity  for  computing  tho  negation  of  a  formula  is  removed,  by 
having  the  complements  of  the  elements  and  computing  posi¬ 
tive  functions  oif  these  elements  and  their  complements. 
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Tho  Duplicating  Punch  s  Modified  to  aooopt  a  pair  of 
cards  in  the  reader,  is  a  complete  positive  logic.  We  use 
tho  Right-Left  card  rotation,  Wq  c an  compute 

(1)  fb  C\lt  in  one  machine  operation 

(2)  u  )A  in  one  machine  operation 

and  (3)  UM)  U(j0ktf)in  two  machine  operations. 


The  Sorter; 

This  machine,  which  will  reverse  tho  order  of  a  pack 
of  cards,  will  solve  tho  problem  of  selecting  a  card  which 
contains  at  least  a  certain  intersection  of  properties, 
from  a  pack.  Given  enough  time,  it  may  even  select  a  card 
identical  to  a  givon  function  or  card.  It  is  the  most 
primitive  machine  for  extracting  information. 

The  Collator;  Ignoring  tho  arithmetical  properties  of 
this  machine,  this  machine  can  bo  treated  as  a  more  effi¬ 
cient  sorter  or  selector.  It  will  select  from  a  pack,  all 
those  cards  identical  to  a  givon  master-card  where  the 
card  consists  of  not  more  than  32  columns,  each  of  which 
contains  not  more  than  one  bit  per  column, 

Tho  Reproducer  Punchi 

This  machine  is  fundamentally  intended  to  duplicate 
either  a  large  number  of  cards  off  a  single  card  or  to 
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reproduce  packs  of  cards.  Wo  shall  show  that  it  oan  be 
regardod  as  a  complete  logic,  and  thus  also  as  a  oomploto 
information  soloction  machine,  far  more  officiant  than  tho 
sorter,  (which  doponds  on  asking  Yea/'No  questions,  col¬ 
umn  by  column)* 

In  torms  of  the  punched  card  logic,  the  operation  of 
duplication  can  compute  *he  disjunction  or  union  of  the 
information  in  a  pack  of  cards,  by  transferring  the  infor¬ 
mation  on  tho  first  card  on  to  tho  second,  which  then  repre¬ 
sents,  say,  0  U  ,  This  output  card  is  then  transferred 
on  to  the  third  card  and  so  on  to  the  end  of  the  pack.  In 
other  words  whore  n  is  tho  number  of  cards  this  machine 
will  compute  U..n  ;  by  tho  Law  of  Duality  this 

is  simultaneously  a  computation  of  tho  formula  0D  0n /f)  n. 

If  we  either  havo  our  information  punched  to  include  the 
complement  cards  of  all  cards,  or  preferably  wo  add  a  ne¬ 
gation  or  a  conditional  switch,  wo  havo  a  machine  which 
will  do  all  tho  necessary  computation  of  our  logical  algebra. 

The  machine  as  it  stands  will  also  compute  the  unions 
or  disjunctions  of  pairs  of  cards  taken  in  order  from  two 
distinct  packs;  if  the  two  packs  in  question  represent  the 
functions  fi  (  . ,n  )  and  Jj  (.,,m)  it  will  compute  the 
sequence  of  unions  0^  u  J6\j  JSZ,  0n,  and  the 


seguonoe  of  intersections  O  »/5n^jtfn  , 

These  two  distinct  machine  properties  depend  on  the 
fact  that  oolutnn  information  may  ho  transferred  to  other 
columns  or  to  the  same  columns. 

In  order  to  regard  this  machine  as  a  selector,  and 
thus  as  a  more  efficient  sorter,  wo  will  take  problem  (3) 
and  give  a  solution,  (a)  If  wo  wish  to  extract  from  a 
pack  of  cards,  all  those  which  include,  i.o,  contain  at 
least  a  particular  intersection  of  properties,  say  P,  G, 
and  H,  this  is  to  oxtract  from  the  pack  all  those  cards 
for  which  Pf!  GO  H  -»1,  i.e,  for  which  P^O^H  =G, 

The  most  economical  solution  is  not  to  compute  this 
union  from  card  to  card  but  from  column  to  column  within 
a  card  by  plugging  the  columns  together.  In  this  caso  we 
wish  to  compute  PUGUH  where  P,  G,  H,  are  all  on  the 
same  card.  Assuming  that  the  information  has  been  singly 
punched  and  that  P  is  in  column  1,  G  is  in  column  5 
and  H  in  column  7,  all,  soy,  in  Row  1^  We  may  then 
pli’.g  the  Reading  brushes  Columns  1,  3  and  7  together  on  to 
say,  Punch  Kagnot  Column  1#  We  reproduce  the  pack  on  to  a 
blank  pack,  at  tho  same  timo  reproducing  an  identification 
number  if  reguirod,  Tho  rosult  is  to  obtain  a  seoond  pack 
of  cards,  in  one  to  one  correspondence  with  our  original 


70 


data  pack,  for  whioh  any  card  with  a  not-hole  In  Column  1 

mm  mm  am 

•  corresponds  to  a  data  card  with  F^G^H  =0  or  P^gOh  »l. 
Thus  each  card  with  tho  desired  property  will  be  identified 
by  a  corresponding  card  with  a  not-hole  in  Column  1, 

Alternatively  by  using  the  automatic  error  stop 
facility  on  the  machine,  wo  may  cause  the  machine  to  stop 
whonovor  PUG  JH  =0  and  then  withdraw  tho  card  from  tho 
hopper.  This  may  bo  done  by  plugging  the  columns  to  one 
side  of  a  comparing  relay,  and  ^ha  other  side  of  that  relay 
to  a  '1*  emitter.  An  ’O'  output  will  then  appear  as  an 
error  to  tho  machine. 

It  is  worth  noting  that  we  may  compute  up  to  80 
functions  of  this  type  simultaneously:  for  wo  can  compute, 
say,  FUGUH  =0  on  column  1,  FJG  =»0  on  column  2, 

mm  mm  mm  ~  ~ 

q  JH  *Jj  s»o  on  column  3  and  so  on. 

Wo  may  also  use  a  similar  procedure  to  solve  problem 
(3)(b),  i,e,  to  find  all  cords  which  have  only  tho  proper¬ 
ties  Ffi  on  H,  If  tho  total  set  of  properties  is 
A,B,C,D,E,F,G,H,  tho  function  we  require  to  compute  is 

aHeO  c  nDnEn  FOGH  H  =1  or 
AUBUC  'JDUBUFUGUH  =0  , 


Hero  it  is  necessary  to  have  the  information  concerning 
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those  properties  punched  both  as  p  and  as  p,  as 

e,g,  in  row  1  and  p  in  row  6,  After  reproducing,  our 
correspondence  cards  will  have  a  not-hole  in  row  6  for  all 
those  cards  with  MH  **C  and  a  not-hole  in  row  1  for 

all  those  cards  with  AUBUCUdUe  =0.  We  can  then  ex¬ 
tract  all  those  cards  for  which  the  correspondence  cards 
have  a  not-hole  in  both  row  1  and  row  6, 

It  is  clear  that  any  problem  of  retrieving  information 
can  be  coded  soi  that  the  answor  will  consist  in  producing 
cards  which  have  a  specifiable  relation  to  a  given  function 
of  that  information:  usually  "Are  thoro  any  cards  satis¬ 
fying  a  given  function  of  the  properties"?  or  ‘'How  many 
are  there"?.  Certainly  in  the  case  of  stock  retrieval,  or 
the  soloction  of  personnel,  or  the  selection  of  a  card  con¬ 
taining  the  birth  record  of  a  particular  individual  -  and 
those  are  throe  problems  which  punched  card  machinery  is 
being  usod  to  solve  at  present  -  the  usual  machine  opera¬ 
tion  is  performed  by  sorting:  this  is  not  only  slow  but 
highly  uneconomic.  The  Reproducing  punch,  if  the  informa¬ 
tion  bo  punched  to  include  complements,  or  modified  as 
suggested,  is  obviously  very  much  faster  and  more  economic. 
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SRCTION  IV 


BRACKETTING  WITH  PUNCHED-OARD  EQUIPMENT 

1,  The  punohed-card  operations  for  syntactical  sentenoe 
bracketting  described  here  have  their  theoretical  foundation 
in  the  work  of  A,F.  Parke r-Rh odes  and  their  immediate 
justification  in  the  algorithms  derived  by  D.  S.  Linney  and 
R.  McKinnon  Wood,  The  work  was  done  on  a  Hollerith 
duplicating  punch  using  two  paaks  of  car^s  punched  with 
Governor-Dependent  and  Word-order  information  respectively 
for  each  word  of  the  sentences  examined.  The  former  was 
obtained  from  the  Governor-Dependent  information  provided 
for  the  words  of  the  Longmans -Green  Defining  Vocabulary  by 
Y.A,  Wilks  and  E,  Rigby,  the  latter  was  provided  by  D,  S, 
Linney,  The  sentences  themselves  were  taken  from  a 
Iwngman s-Green  elementary  English  story-book  (''Tales  from 
the  East")  based  on  the  Vocabulary,  This  paper  describes 
attempts  to  obtain  one  satisfactory  bracketting  of  each 
sentence.  For  this  bracketting,  i,e,  this  particular 
distribution  of  parenthesis  signs,  it  is  a  necessary,  but 
not  a  sufficient,  condition  for  the  formation  of  any 
particular  bracket  that  the  set  of  contiguous  substituents 
under  consideration  can  form  a  valid  bracket.  That  is  to 
say,  all  the  attempts,  starting  at  the  right-hand  end  of 
the  sentenoe,  aim  to  looate  the  longest  possible  brackets. 
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The  results  are  used  to  compare  two  "strategies" ;  one .  the 
straightforward  performance  of  the  Governor-dependent  and 
Word-order  algorithms  simultaneously  for  each  pair  of 
substituents  considered  ("in  parallel")  and,  two,  a  strategy 
using  G.D,  in  a  normal  and  word-order  information  in  a 
restricted  form,  together  with  certain  other  rules,  in  an 
endeavour  to  avoid  some  of  the  card  storage  problems  arising 
with  the  other  methods. 

The  latter  method  is  outlined  and  accompanied  by  a 
partial  formalization  in  terms  of  a  flow-chart  and  a  "manual 
program"  in  an  adapted  form  of  the  Parker-Rhodes 
Algorithmic  Notation  (q,v, ),  This  is  preceded  by  (2)  an 
outline  of  the  forms  of  presentation  of  the  syntactical 
information  on  the  cards,  and  of  the  performance  of  the 
algorithms  with  the  cards  so  punched;  (3)  a  description  of 
the  notation  used  to  display  the  bracketting  process,  and 
(U)  a  description  of  the  preliminary  operation  of  checking 
the  dictionary.  This  last  consisted  in  providing  a  satis¬ 
factory  intuitive  bracketting  of  each  sentence  and  then 
operating  with  Govern  or- dependent  and  word-order  cards  in 
such  a  way  as  to  ensure  that  the  information  punohed  on  them 
at  no  stage  exluded  this  bracketting  in  its  correct  substitu¬ 
ent  type.  Were  this  not  done  a  mistake  in  the  compilation  of 
the  dictionary  entry  for  a  word  oould  make  all  operations 
with  that  word  useless. 
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Tho  centre  portion  of  columns  (28  -  53)  was  allotted 
to  the  punched  code  for  the  name  of  the  word.  The  columns 
(2  -  27)  on  the  left-hand  side  were  divided  into  adjacent 
pairs  from  the  left,  i,e»  2  and  3,  4  and  5  eto.  and  one 
such  pair  was  taken  to  represent  the  behaviour  of  the  word 
in  one  of  the  13  substituent  types  considered  for  English, 
i.e,  the  syntactical  tehaviour  of  the  word  (as  Governor 
or  dependent  in  the  one  case,  as  initial  or  final  in  order 
in  the  other)  in  substituent  type  4  is  recorded  in  columns 
10  and  11,  the  types  being  considered  in  the  order;  conjunct¬ 
ion,  one,  two, twelve. 


For  both  types  of  information  there  are  four  possible 
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entries  in  binary  arithmetic  for  a  word  in  any  given 
substituent  type;  00,  01,  10,  11,  The  first  digit  of  the 

2  digit  figure  is  recorded  with  a  hole  in  the  numerically 
lower  of  the  two  columns,  the  second  digit  is  similarly 
recorded  in  the  next  column  jy2<|  the  column  on  the  right- 
hand  side  whose  number  is  (82  -  number  of  the  relevant  column 
on  the  left-hand  side  of  the  card)  -  this  is  the  punohing 
procedure  for  Governor-dependent  information:  the  Word- 
order  information  has  its  second  digit  punched  only  in  the 
column  on  the  right-hand  side  of  the  card.  In  each  case  if 
the  digit  is  a  ”1"  a  hole  is  punched  in  row  1,  if  it  is  a  "O" 
a  hole  is  punched  in  row  6, 

It  will  be  evident  that  we  have  the  possibility  of  two 
'•dimensions  of  comparisob"  between  cards;  upper-lower  and 
right-left  of  the  card  faces,  D, S,  Linney  and  R,  McKinnon 
Wood  have  investigated  the  potentialities  of  computing 
logical  sums  and  products  by  the  method,  and  have  shown  that 
the  Boolean  algorithms  for  both  Govern or- dependent  and  Word- 
order  algorithms  can  be  given  for  a  pair  of  substituents  by 

3  and  4  card  operations  each,  using  only  the  right-left 
hand  reversibility;  viz,  for  a  pair  of  substituents  A,  M 
occurring  in  that  order  in  a  sentence. 
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i)  Qovernor-dependent  algorithm 

1.  Take  intersection  of  left-hand  side  card  (A)  and 

left-hand  side  of  card  (id)  backwards  on 
left-hand  side  of  card  (C) 

2.  Take  intersection  of  left-hand  side  of  card  (M) 

and  left-hand  side  of  card  (A)  backwards  on 
left-hand  side  of  card  (C) 

3.  Take  intersection  of  left-hand  side  of  card  (A) 

backwards  and  left-hand  side  of  card  (M) 
backwards  on  left-hand  side  of  card  (C) 
backwards. 

Note  that  since  the  same  Part  of  the  same  output  card 
is  used  for  2  and  3  wo  have  in  effect  produced  a  union  of 
intersections. 

ii)  Word-order  algorithm 

1,  Take  union  cf  l.h.s,  of  card  (A)  and  l.h,  s.  of 

card  (id)  on  l.h,  s,  of  card  (c) 

2,  "  union  of  l.h.s.  of  card  (A)  backwards  of  card 

(M)  on  l.h.s,  backwards 

3,  "  union  of  l.h.s.  of  card  (A)  backwards  of  card 

(M)  backwards,  backwards. 

In  each  case  the  output  card  C  indicates  the  syntact¬ 
ical  properties  of  the  combination  of  A  and  M,  .its  inter¬ 
pretation  is  given  on  the  reproductions  of  cards  (see  pp,  6-7) • 


The  algorithms  are  recursive  and  in  each  case  the  out¬ 
put  card  can  be  used  as  a  substituent  to  determine  the  proper¬ 
ties  of  the  group  A,  id  and  the  next  word  occurring  in 
reverse  sentence  order. 
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The  participation  class  entries  for  eaoh  of  the  13 
substituent  typos  for  English  are  recorded  separately  for 
Governor-dependent  (GD)  and  Word-order  (WO)  information 
on  what  are  called  'Master-cards',  indicated  by  type  and 
variety  of  information  thus  M(GD)7  *  "the  card  bearing 
the  G-D  participation  class  entry  for  any  substituent  of 
type  7". 
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3.  The  words  of  the  sentence  are  written  down  the  l.h,  c» 
of  the  page  in  normal  spoken  ordef.  The  bracks tting  operat¬ 
ions  begin  with  the  two  lowest  words  in  this  order  and  work 
vertically  up  the  sentence.  The  brackets  are  drawn  to  indi¬ 
cate  that  the  words  within  them  are  being  considered  as  a 
group.  Their  bracketting  possibilities  as  indicated  by  the 
output  card  for  the  group  of  words  concerned  are  shown  by  a 
series  of  numbers  vertically  in  numerical  order  from  the  top. 
When  on  the  1, h.sc  of  the  bracket  they  indicate  G-D  possibil¬ 
ities,  when  on  the  r.  h.s. ,  W-0  possibilities.  Numbers  not  in 
parentheses  indicate  the  types  within  which  the  group  forms  a 
valid  bracket,  those  in  parentheses  show  the  types  with  which 
the  group  could  form  a  valid  bracket  given  certain  properties 
of  words  yet  to  be  considered. 

Groups  of  words  linked  by  a  mark  ^  have  had  a  master¬ 
card  substituted  for  them  of  the  type  indicated.  This  has 
been  bracketted  as  a  substituent  within  the  bracket  immediately 
to  its  right. 

The  symbol  ^  indicates  that  it  is  from  this  point 
upwards  that  the  bracketting  is  being  done  to  the  right  of 
the  mark  as  far  as  the  next  sign  of  this  type, 

|  Heavy  signs,  as  left,  indloate  master-cards  repre¬ 
senting  the  bracketting  together  of  master-oards  themselves. 
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This  Is  normally  done  when  the  beginning  of  the  sentence  has 
been  reached,  and  wo  turn  to  consider  how  its  longer  substi¬ 
tuents  bracket  together. 

The  "vanishing”  of  substituent  numbers  on  the  way  up  the 
sentence  indicates  the  diminution  of  bracketting  possibili¬ 
ties  as  more  words  are  considered. 

The  sign  'X'  indicates  that  an  output  card  is  blank 
i.e,  that  the  bracket  group  is  certainly  invalid,  in  other 
words  that  the  bracket  is  impossible,  that  is  to  say  it  is 
not  a  true  group  at  all.  After  such  a  sign  the  next  operation 
(substitution  of  a  master-card,  or  re-bracketting  from  the 
last  y  sign)  is  recorded  by  a  bracket  drawn  parallel  to 
the  last  but  to  the  right  of  it.  The  sign  #  at  the  head 
of  a  bracket  indicates  the  application  of  iv)  rule,  (see 
P,88  ). 

The  sign  S  by  a  •'master-bracket”  indicates  that 
this  is,  temporarily  at  least,  stored  as  the  bracketting  of 
the  sentence  up  to  that  point.  The  bracketting  finally 
obtained  by  the  method  used  is  written  at  the  bottom  of  each 
sentence-sheet.  The  sign  f  denotes  application  of  the 
rule  v)  (see  P,  ), 
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4.  (  (  A  B  )(  (  0  D)  S  )  )  ) 

12,11  7,6 

The  die tionary-cheo king  consists  in  bracketting  the 
sentence  using  the  algorithms  in  the  knowledge  of  the  "correct" 
answer.  This  latter  is  provided  by  hand  and  displayed  as 
above,  the  type  of  the  brackets  being  written  under  the  first 
parentheses.  We  take  the  intuitively  "innermost*  bracket 
and  ensure  that  the  G-D  output  oard  for  its  constituents 
"permits"  a  bracketting  in  type  6  (i.e.  has  a  hole  punched 
in  row  1,  column  14) »  Wo  then  immediately  substitute 
MC ( GD) 6  and  ensure  that  this  can,  with  card  (E)  form  a 
bracket  in  type  7»  we  then  ensure  that  A  and  B  together 
bracket  in  type  11,  and  finally  ensure  that  MC(GD)ll, 

M0(GD)7  together  yield  a  card  permitting  bracketting  in 
type  12  -  i.e,  that  this  internal  bracketting  is  consist¬ 
ent  with  tho  whole  standing  as  a  sentence  or  "free  clause". 

This  sequence  of  operations  is  then  repeated  with  the  W-0 
cards. 


If  the  result  is  tho  same  as  the  one  sought  and  obtain¬ 
ed  with  tho  GD  cards  we  assert  that  the  intuitive  bracketting 
of  the  sentenco  is  at  no  point  excluded  by  the  information  in 
tho  two  syntactic  dictionaries. 


The  sentences  below,  from  Longman s-Gre an  "Tales  from 
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the  Bast0  p,  26  are  numbered  in  order  from  1  -  10,  The 
intuitive  braokatting  of  each  is  reproduced  at  the  head  of 
the  sheets  bearing  the  braokatting  results  for  that  sentenoe. 


(1)  "In  the  northern  part  of  Penang  there  is  a  little  stone 

house  near  the  sea, 

(2)  Near  the  little  stone  house  there  is  a  hut, 

(3)  The  little  stone  house  is  there. 

(4)  The  peoplo  call  it  Mat's  hut. 

(5)  Thoy  toll  the  story  of  Ikan  Dyong. 

(6)  He  had  two  sons  by  his  first  wife, 

(7)  His  second  wife  was  ycung  and  very  beautiful. 

(8)  She  had  blue  eyes  blue  like  the  sea, 

(9)  She  had  a  son  Halim. 

(10)  His  eyes  were  blue  like  tho  morning  sky”* 

5.  In  general  the  bracketting  procedure  is  applied  accord¬ 
ing  to  the  following  rules:- 

1,  continue  generating  output  cards  until  a  blank 
card  is  obtained 

2,  when  this  happens  examine  the  previous  output  card 

and  substitute  tho  master-card  corresponding  to 
the  numerically  lowest  substituent  type  'admitted' 
by  that  card. 

3,  If  bracketting  cannot  continue  with  this  card  substi¬ 

tute  the  next  'permitted'  mastercard  in  numerical 
order. 
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lj.  If  all  this  fails  examine  the  previous  output-card 
and  repeat  the  above  procedure. 

5.  If  all  this  fails,  even  for  the  output-card  for  the 

first  two  substituents  examined,  re-braoket  the 
sentence  beginning  with  the  last  word  but  one  of 
tho  string  of  words  then  under  consideration: 
(repeat  if  necessary), 

6.  When  doing  this  retain  tho  permitted  bracketting 

up  to  tho  point  from  which  the  bracketting  begins, 

7.  Cn  reaching  the  beginning  of  the  sentence  repeat  the 

whole  process  with  the  substituents  now  obtained 
(tho  "secd>nd-lovel  bracketting)  to  achieve  a  final 
permitted  bracketting  in  typo  12, 


In  strategy  I,  the  implicit  requirement  of  1  -  7  above 
(i.o,  that  wo  should  obtain  tho  longest  possible  bracket)  is 
modified  using  word  order  information  in  a  restricted  manner 
to  reduce  tho  amount  of  "breaking-back"  of  5  above.  Unless 
at  loast  this  is  done  the  sentence  will  frequently  not 
bracket  at  all. 


Conjunctions  are  troatod  as  follows:  mark  on  both 
sides  of  the  conjunction  (determined  by  a  given  list  of 
conjunctions)  tho  smallest  self-contained  bracket.  Endeavour 
to  bracket  both  parts  separately  in  tho  form  of  master-cards 
(unloss  the  substituent  is  a  single  word)  with  the  conjunct¬ 
ion  in  type  C„  If  this  can  be  done  wo  have  a  C  bracket. 

Tho  output-card  for  this  group  for  further  use  is  bracketting 
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the  whole  sentonce  containing  the  conjunction  is  formed  from 
tho  logical  sum  of  the  master-cards  for  the  two  outer  parte 
of  tho  group* 

i.o,  "(young)  and  (very  beautiful)" 

Output-card  is  the  union  of  the  card  for  "young"  and 
MC (GD) 7  (for  "vory  beautiful")  for  GD  work  and  similarly 
for  WO. 

This  is  a  weaker  ouiinP  than  taking  the  intersection 
or  logical  product,  which  may  bo  more  fruitful  in  future 
work,  bocauso  for  instance  -  "cats  and  books"  is  not  a  verb. 


6,  a.Notos  on  Strategy  I. 

Tho  rulos  and  "program",  outlined  here  represent  an 
attompt  to  avoid  both  long-winded  1 back- trackings' ,  and  tho 
problems  of  choico  and  store  involved  in  using  WO  and  GD 
information  in  parallel.  It  is  true  that  tho  "stronger"  the 
information  usod  tho  loss  the  chance  of  a  'bad  choice'  being 
made.  But  what  is  presented  horo  uses  a  weaker  'binary*  form 
of  tho  WO  algorithm  (boo  infra)  in  the  hope  that  a  sacrifice 
of  strongth  may  pay  in  torras  of  simplicity  i.o,  by  avoiding 
problems  or  using  information  in  parallel  while  at  tho  earns 
time  looking  for  tho  longest  possible  bracket.  Tho  latter 
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strategy  would  “bo  all  too  simple  without  the  last  require¬ 
ment  -  i.e*  if  wo  merely  "made  the  appropriate  bracket" 
after  every  word*  This  would,  however,  lead  not  only  to 
redundancy  but,  in  the  case  of  sentences  whose  "deepost 
point"  was  not  at  the  end,  it  would  miSB  braokets  altogether. 

Tho  efforts  here  add  up  to  attempts  to  mark  boundaries 
{  before  beginning  tho  brackotting  proper  which  aro  observed 

when  going  'up  or  down'  tho  sentence,  and  is  thus  an  attempt 
to  preserve  information  in  two  different  ways. 

Wo  require  a)  a  success  routine  (i.e.  we  reach  the 
beginning  of  tho  sentence)  b)  two  limited  failure  routines 
i)  doos  not  bracket  till  after  "back-tracking",  ii)  does 
not  brackot  oven  aftor  "back-tracking",  c)  a  routine  for 
brackotting  master  substituent  together  after  accepting 
certain  mastors  and  rejecting  others,  c)  will  be  assumed 
(  to  bo  of  tho  form  ' a  +  b’ ,  since  substituents  are  not 

distinguished  by  their  length.  Wo  assume  further  that  the 
routinos  a,  b,  provide  only  one  unamiguous  brackotting 
of  tho  sontonco,  aftor  whoso  final  brackotting  by  c  the 
procedure  stops, 

Tho  strategy  difforB  from  tho  general  rules  given  in 


5  in  tho  following  waysi 
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i)  a  master  is  substituted  whenever  a  "break”  (see 
infra)  is  reached 

ii)  this  substitution  is  never  ’unmade'  when  subse¬ 
quently  back- tracking  i.e,  when  "moving  the  end"  of 
the  sentence  it  is  never  moved  to  a  point  within  the 
substituent,  but  to  the  last  "break"  passed 

iii) by  recording  the  brackatting  up  to  that  point  as 
a  substituent  to  bo  brackotted  later  (on  reaching  the 

end  of  the  sentence)  as  part  of  a  second-level -br&cfcetting, 

iv)  a  rule  *  that  whenever  an  output  card  has 
governor  holos  but  only  in  one  or  more  of  types  10,  11, 

12  wo  move  the  end  of  tho  sentence  >  to  above  the 
last  substituent  considered  and  continue  brackatting 
from  thoro,  whilo  storing  tho  output  so  far. 

v)  wo  includo  a  rule  that  if  the  'front'  of  the  sen¬ 
tence  is  roached  in  such  a  way  as  to  exhibit  the  whole 

sontonce  in  some  substituent  type  other  than  12  wo 

rojoct  that  type  (and  call  the  output  zero). 


This  last  rule  enables  us  to  deal  with  complex  subjeots 
and  phrases  whioh  precede  tho  subject  of  the  sentence  with¬ 
out  tedious  amounts  of  ' baol&- tracking ' .  In  some  sentences 
the  rulo  has  not  been  applied  in  order  to  compare  the  labour 
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involved  in  not  using  it.  (See  sentences  3»  8  and  10).  The 
orders  applying  it  in  tho  ’program’  can  bo  deleted  by  de¬ 
leting  orders  40  and  41  and  substituting  ’10’  for  '40'  in 
order  22. 

Tho  'breaks'  are  indicated  in  the  results  by  horizontal 
rod  linos.  They  aro  of  four  typos,  conjunction,  tho  begin¬ 
ning  of  the  sontonco,  (both  discussed  abovo),  binary  word 
order  and  propositions  (soo  infra). 

3inary  word  ordor  -  Wo  precede  tho  program  with  the 
application  of  tho  word  ordor  algorithm  to  all  contiguous 
pairs  of  words  in  tho  sontonco  and  mark  a  break  botween  tho 
members  of  those  pairs  tho  algorithm  rejects  entirely  (i.o. 
whore  a  'final'  word  procodos  an  'initial'  word  in  every 
substituent  typo  in  which  both  occur).  This  gives  a  neces¬ 
sary  and  sufficient  condition  that  a  bracket  bo  drawn  at 
thi«  point  i.e.  no  connection  rejected  in  this  way  oan  be 
roassertod  by  an  output  from  a  larger  brackot. 

Propositions  aro  marked  by  a  break  line  preceding 
them  in  sontonco  ordor  in  a  mannor  analogous  to  the  conjunct¬ 
ion  rulo.  Wo  assume  an  oxtonsional  list  of  propositional 
words  and  mark  thorn  so  only  if  they  aro  both  preceding  and 
followed  by  oithor  a  noun,  an  adjective  or  no  word  at  all. 

It  will  bo  soon  that  strategy  II,  'gots  out'  "preposition 
brackets"  (type  2)  more  easily  than  this. 
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6,  To.  Notation 

Consider  a  sentence  of  N  words  or  basio  substitu- 

onts  (i. e.  initial  units  with  listed  dictionary  entries) 

numbered  1  -  N  from  tho  beginning  of  a  sentence.  Word-order 

(WO)  breaks  occur  after  words  numbered  a,  b,  c,  (and  N, ). 

Tho  strategy  to  bo  outlined  provides  acceptable  words  - 

cy 

output  cards  of  a  form  f^|  ( ii  )  each  having  a  stored  G-D 
entry,  whero  ^ 

g  donotos  tho  number  of  this  unit  numbered  from 

the  back  of  tho  sentence  of  accepted  master  units, 

g  denotes  tho  position  number  of  tho  first  word  in 
tho  substituont  or  unit  (  1  i,  3  ^  N) 

h  denotes  tho  current  valuo  of  a  (see  below). 

At  any  moment  wo  deal  only  with  two  word-cards  from 
tho  first  n  words  of  N  after  the  word  whose  original 
place  number  is  tho  current  value  of  &  (initially  l) 

This  enables  us,  aftor  'reaching'  tho  beginning  of  the 
sontonco  from  any  point  'n*  to  return,  set  &  a  n  ,  Nsn 
(i.e,  the  new  'n')  and  brackot  up  to  'n*  from  N, 

Tho  two  cards  in  hand  at  any  time  are  tho  word  card 
-  (numborod  from  tho  beginning  of  tho  n  -  word  'sentence') 
and  tho  card  X^.  -  (representing  either  a  word  or  a  substi¬ 
tuent)  representing  tho  substituent  last  obtained  or  dealt 
with,  this  latter  is  therefore  tho  r+i  effective  substituent. 
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The  GD  brackotting  algorithm  (BA)  ia  performed  on 
those  two  cards  to  produce  the  output  card  X  »  (whioh  is 

r*  -I 

in  turn  compared  with  the  ►  -»  word  card)*  (r)  and  (x) 

will  refer  to  tho  cards  in  question*  Questions  like  (xj-  ? 

will  ask  whether  there  aro  any  govornor  or  dependent  holeB 

on  (X  )•  The  value  of  the  variable  y  at  any  time  refers  to 
\i 

tho  position  nurnbor  (  I  <  ^  £ u  )  of  tho  furthest  left  of 
tho  'unusod1  (i.e,  not  so  far  considered  by  tho  process) 
govornor  holos  on  Xr  -  the  set  of  them  being  labelled 
etc.  (  «*  <  p  <  tS  ),  Tho  symbol  1 G*  will  bo  taken  to 
indicate  tho  master-card  for  the  typo  y, 

S  donotos  a  'store-operation’. 

By  appropriate  re-assignment  of  variables  (not  given) 
when  n  =  0,  tho  "second-order  bracketting"  of  tho  'M' 

units  is  achiovod  by  the  samo  sorios  of  questions  and  orders. 

An  exclamation  mark  (,' )  indicates  an  order  which  is 
oxocutod  and  tho  noxt  order  executed  is  tho  one  whose  number 
occurs  in  tho  bracket  following  tho  exclamation  mark.  A 
question  mark  (?)  indicates  a  question.  If  tho  answer  is 
'yes'  tho  procedure  moves  to  tho  order  whoso  number  occurs 
first  in  the  brackot  following  tho  question  mark.  Conversely 
if  tho  answer  is  'no'  tho  procedure  moves  to  the  order  indi¬ 
cated  by  tho  second  digit  in  the  braokot, 

Tho  notation  is  a  simplified  form  of  the  P*rker-Rtaodes 


Algorithm  Notation, 
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6.  d,  Program 

1.  r(r;0)  0  (r^n^N 

2.  =  /X/ 

3.  =  /W 

4.  4  ■  * 

5.  (r+i)  =(  X,  r)  • 

6.  q  si  l 

7.  n«N.» 

8.  r^n-1 

9.  y=o<  1 

10.  rs  a%Ac  ?  (11,17) 

11.  (G.y)s(X.r)  *  S’ 

12.  ya«<  ?  (14,13) 

13.  y=/4?  (15,16) 

14.  y=(W  (  17  ) 

15.  y-  (  17  ) 

16.  ys.'  ^17) 

17.  BA  (  (r) ,  (X.  r)  )  l 

18.  h-n  .’ 

19.  (X.  r-l)»?  (20,21) 

20.  y-?  (23,11) 

21.  r=i4l  I 

22.  r=  4-+1  ?  (38,40) 

23.  rsa  v  (27,24) 

24.  r»b  ?  (28,25) 

25.  rac  ?  (29,26) 

26.  n.M-1  ?  (30,34) 

27.  n.a  .'  (31) 

28.  nab  !  (31) 

29.  nao  .'  (31) 

30.  n«N-i  .'  (31) 

31.  8  lij(h)  -X,  r 


32.  q.q+1  * 

33.  raaVoVl  ?  (8,7) 

34.  r*n-l  ?  (35,36) 

35.  n.n-1  .’  (31) 

36.  r»r-l  .* 

37.  unstora  S’  (31) 

38.  n«N  ?  (43,39) 

39.  4-1  ?(EXIT,  42) 

40.  y-lA2A3*4A5A6  JA8A9 

?  (10,41) 

41.  n*r  (31) 

42.  4  -n-1  .'  (31) 

.  A  b. 

43.  p>  o  »12t  (EXIT, 44) 

44.  r»l  .’ 

45.  (X, r-i)  •  !  (20) 


1227  2  12  11  7 

( (in(the  north  part (op  penang)))) (there (is (a 

LITTLE  STOKE  HOUSE  )  (NEAR  (THE  SEA))))) 


7 


NORTH 


PART 


PENANG 


THERE 


LITTLE 


STONE 


HOUSE 


NEAR 


((In  (the  north  part (of  Ponang) ) ) (thoro  is(a  little  stone  house ( 
2  7  2  12'  7  2 

noar(tho  sea))))) 

7 


95 


2.  ( (near (the  litti$  stone  house)) (there (is (a  hut))^) 

12  2  7  11  II 


((Noar(tho  littlo  stono  house) ) (there  is(a  hut))) 
12  2  7  12  7 
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3.  ((THE  LITTLE  STONE  HOUSE) ( IS  THERE)) 


12  7  11 


Y»(crr.k,  Y) 


Bracketting  displayed  after  t r 

line  shows  ' Lack-breaking' 
procoduro  had  wo  not  applied  x 
rulo  at  X. 

i)  ((The  little  stone  house) (is  there)) 
12-7  11 

ii)  as  for  i) 
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k,  (  (THE  PEOPLE) (CALL  IT  (MATS  HUT))) 
12  7  11  7 


x  *-  indicates  that  rule  *  was  triad  at  this 
and  failod, 

((The  people) (call( it  Mats  hut))). 

12  7  11  7 
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5.  (THEY (TELL(THE  STORY(OF  IKAN  -  DYONG)))) 
12  11  7  2 

*■ 

— 

THEY  >2 

TELL 


THE 

STORY 


OP  2\ 

6 

# 

I  KAN- DYONG  13 


(They  toll(the  story(of  Ikan-Dyong) ) ) 
12  7  2 
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6.  (HE(HAD(TW0  S0NS(BY(HI8  first  wifi))))) 


12  11  7  2  7 

HE 

HAD 

TWO 

SONS  ^ 

"t - * 

I 

! 

->j-  - 

>i 

0 

m  a 

...  .  — » 

HIS 

FIRST  *7 

WIFE 

— 

• 

i  __ 

(no (had  two  sons  (*y  (his  first  wife)))) 
12  11  27 


Application  of  tho  rula  *  is  trivial  hero,  tho 
procedure  would  havo  worked  without  it  on 
substitution  of  an  'Il-tnastor* , 
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8.  (SHE  (HAD (BLUE  EYES) (BLUE (LIKE (THE  SEA))))) 
1  2  11  5  2  7 


(Sha (had  blue  eyes  bluo (like (the  sea)))) 
12  11  2  7 


?  x 


rule  cculd  have  been  applied  trivially  here,  as 
in  sontonoo  6,  -  it  makes  no  difference* 
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10,  (  (HIS  EYES) (WERE(BLUE(LIKE(THE  MORNING  SKY))))) 

12  7  11  5  2  7 

i 


(i)  (  (  His  eyoa)(woro  blue (like (the  morning  sky)))) 

12  7  31  2  7 

(ii)  (  (  His  oyos)(  (woro  blue)  (like  (the  morning  sky)))) 

12  ?  111®  2  7 


The  arrows  ( t ) 
at  that  point. 


indicate  a  repeat  (not  shown)  of  the  cycle  X-Y 


Notes  on  Strategy  II 


1*  Output  cards  aro  generated  at  the  same  time  for 
each  pair  of  substituents  considered  for  both  GD 
and  WO  information, 

2,  If  either  typo  of  information  yiolds  a  blank  card 
a  substitution  of  mastors  is  made  at  that  point  for 
both  typos  of  information, 

3,  The  master  chosen  is  always  that  of  the  substitu¬ 
ent  type  of  lowest  number  occurring  in  the  inter¬ 
section  of  the  sots  of  numbers  of  the  substituent- 
typos  permitted  by  the  two  output  cards  generated 

at  that  point.  The  sets  aro  of  the  "distinct  possi¬ 
bilities  of  association  at  this  point"  only  i.e, 
not  those  that  roquiro  us  to  look  at  words  not  so 
far  considered, 

4,  The  WO  results  at  each  stage  aro  written  on  tho 
right  of  tho  "brackotting  line",  the  GD  on  tho  loft, 

5,  If  tho  Intersection  of  total  possibilities  at  any 
stage  (i.e,  not  tho  one  specified  in  (3))  is  zero, 
wo  treat  it  as  wo  would  tho  appoarance  of  a  blank 
card  for  either  typo  of  information. 
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6,  Tho  rulo  x  is  applied  to  avoid  "back-tracking 
(as  in  I.)  in  this  form*  —  -  if  the  intersection 
(specified  in  3)  consists  only  of  one  or  more  of 
tho  types  10,  11,  12  wo  continuo  bracketting  the 
sontonco  starting  from  above  the  last  word  substi¬ 
tuent  considered*  At  tho  same  time  wo  store  the 
bracketting  of  tho  sentonco  up  to  this  point  so  far 
obtained, 

7,  The  conjunction  rulo  is  applied  as  in  I,,  and 
thero  is  no  proposition  rule.  Other  notation  is  as 
in  I, 
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:  12  2  7  2  12  11  7 

*  ( (in(the  NORTH  PART(OF  PENANG))) (THERE (IS  ( A 

„  LITTLE  STONE  HOUSE)  (NEAR  (THE  SEA))))) 
q faults  II,  2  7 


((in  (tho  north  part  (of  Penang)  (there  is  (a  little  stone 
12  2  7  2  12  7 

house  (  near  (  the  sea))) 

2  7 


108. 


3..  ((THE  LITTLE  STONE  HOUSE) 
12  7 


THE 

LITTLE 

STONE 

HOUSE 

IS 

THERE 


1 


(IS  THERE)) 
11 


(  (  The  little  stone  house) (is  thoro)) 
12  7  11 
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U.  ((THE  PEOPLE  )  (CALL  IT  (MATS  HUT))) 
12  7  11  7 


- 

l 

1 

i- 

!• 

THE 

7 

r 

j 

j 

PEOPLE 

■ 

j 

l 

t 

i 

CALL 


IT 


7* 


MATS  7 


HUT 


7 


7 


it 


/fe 


(  (  Tho  pcoplo)  (call  (it  Mat's  hut))) 
12  7  lx  7 


1 
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5.  (they (tell (ties  story(op  ikan-dyong) ) ) ) 

12  11  7  2 


( 


( 

V 


(ThQy(toll(tho  story  (of  Ikan-Jtyong)) ) ) 
12  11  7  2 


Ill 


6.  (HE(HAD(TWO  SONS (BY (HIS  FIRST  WIFE))))) 
12  11  7  2  7 


,2 


7.  ( (HIS  SECOND  WIFE)  (WAS  (YOUNG  AND(Vi£RY  BEAUTIFUL)))) 

12  7  10  0  5 


(ii)  examines  the  bracketting  not  using  the  10-11-12 
rule,  (It  assumes  that  the  'CT  bracket 
constitutes  a  unit  which  cannot  be  'unmade'). 


(1) 

a 

/?,\  (  (  His  socond  wife)  (was(young  and(very  beautiful)))) 

'xx;  12  7  10  C  3 
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8.  (SHE  (HAD (BLUE  EYES)  (BLUE (LIKE (THE  SEA))))) 
12  11  7  5  2  7 

* 


(  Sho  (had  blue  eyes  blue  (like  (the  sea)))) 
12  11  27 


(HIS  EYES)  (V/EHE( BLUE  (LIKE  (THE  MORNING  SKY))))) 
.2  7  11  5  2  7 


ora(bluo (like(tho  morning  sky))))) 

5  2  7 
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t  n  CONCLUSION 

It  will  bo  sean  by  comparing  tho  proportions  of 
"success  brackets"  obtained  by  strategies  I  and  II  that 
II  is  better  in  two  ways: 

1)  it  requires  pre-editing  only  for  conjunctions, 

2)  tho  rosults  displayed  in  procoding  pages  show  that  II 
"misses"  fivo  brackets  (of  which  one  is  a  plausible 
alternative  brackotting) ,  while  I  "misses  nine  (of 
which  two  are  plausible  alternative  bracke t tings) . 

I  think  it  has  been  demonstrated  hero  that  tho 
decroaso  in  back- tracking  obtained  amply  Justifies  the  use 
of  some  form  of  tho  "10-11-12"  rule.  (See  p,  88  (iv)). 
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SUMMARY  CP  CODING 


List  of  Substituent  Types 

1.  Adverbial  Group 

2.  Adverbial  O-Clause  (also  funotlons  as  adjunotive) 

3.  Adverbial  S-Clause  (also  funotlons  as  nominal) 

4.  Adverbial  Subjunot 

5.  Adjunotive  Group 

6.  Adjunotive  S-Clause 

7.  Nominal  Group 

8.  Nominal  O-Clause  (also  has  adverbial  &  adjunotive  uses) 

9.  Nominal  Subjunot 

10,  Operative  Group 

11,  Operative  S-Clause  or  Predloate 

12,  Full  Clause 


Definition  of  Coding  Digits 


v 


Word -Order 


1  ;  oan  be  initial,  but  may  not  follow  a  w2n 

2  ■  oan  be  final  and  may  not  preoede  a  nln 

3  i  oan  ooo ur  anywhere 
0  *  oannot  ooour  at  all 


Conjunction 

:  1 

m 

m 

oan  be  joined  by  a  oonjunotion 

2 

9 

oan  act  as  a  oonjunotion  (narrow  sense 

3 

m 

oan  both  join  and  be  joined 

0 

s 

oannot  be  Joined  by  a  oonjunotion 

Dependency 

J  1 

= 

oan  be  dependent  only 

2 

X 

oan  be  governor  only 

3 

9 

oan  be  either 

0 

S 

oannot  ooour  at  all 

1. 


SYNTAX  DICTIONARY 


Revised  June  1962 


Word  A.L.Nt  — — — — Part ioipat ion 

“  Word -Order  ConJ  Dependency 


A 

0000 

abstract 

0300 

abyss 

0000 

accomplished 

0002 

aooordanoe 

0200 

activities 

0200 

adopted 

0002 

adored 

0002 

age-old 

0002 

ages 

0220 

agriculture 

0200 

all 

0000 

allows 

0000 

alphabets 

0200 

also 

1000 

amount 

0220 

an 

0000 

analysis 

0200 

animal 

0002 

another 

0202 

anthoropology 

0200 

any 

0202 

apologize 

0020 

approaohed 

0022 

are 

0220 

Aristotle 

0200 

ana 

0020 

articles 

0200 

as 

0101 

asked 

0022 

attempts 

0200 

authentlo 

0002 

authority 

0200 

away 

3002 

6 

0200 

bad 

0002 

ball 

0020 

banana. 

0000 

00 

100 

000 

1 

0000 

00  100 

000 

10 

300 

230 

1 

0100 

20  300 

320 

00 

300 

000 

1 

0000 

00  200 

000 

30 

100 

230 

1 

0001 

20 

100 

320 

00 

320 

021 

0 

0100 

00 

210 

Oil 

00 

320 

021 

1 

0100 

00 

210 

Oil 

30 

100 

230 

1 

0001 

20  100 

320 

30 

100 

230 

1 

0001 

20 

100 

320 

30 

100 

020 

1 

0001 

20  100 

010 

02 

300 

333 

1 

0120 

02 

200 

333 

00 

320 

021 

1 

0100 

00 

210 

Oil 

30 

100 

020 

1 

0000 

20 

100 

010 

00 

000 

210 

1 

0000 

00 

000 

220 

00 

320 

021 

1 

0100 

00  210 

Oil 

10 

000 

113 

3 

1000 

10 

000 

111 

02 

200 

312 

1 

0120 

02 

200 

320 

00 

100 

000 

1 

0000 

00 

100 

000 

00 

320 

021 

1 

0100 

00 

210 

Oil 

30 

200 

020 

1 

0001 

20  300 

010 

00 

130 

021 

1 

0101 

00 

110 

Oil 

00 

320 

021 

1 

0100 

00  210 

Oil 

00 

130 

021 

1 

0101 

00  110 

Oil 

02 

020 

212 

1 

0020 

02 

020 

322 

32 

100 

232 

1 

0021 

22 

100 

322 

02 

320 

333 

1 

0120 

02 

210 

232 

00 

320 

021 

1 

0100 

00  210 

Oil 

02 

200 

310 

1 

0020 

02 

200 

320 

00 

320 

021 

1 

0100 

00 

210 

Oil 

10 

000 

000 

0 

0202 

10  000 

000 

32 

100 

232 

1 

0021 

22 

100 

322 

00 

300 

211 

1 

0100 

00 

200 

333 

30 

100 

020 

1 

0001 

20 

100 

010 

00 

320 

021 

1 

0100 

00  210 

Oil 

00 

000 

220 

1 

2001 

00 

000 

110 

00 

320 

021 

1 

0100 

00  210 

Oil 

30 

100 

020 

1 

0001 

20  100 

010 

02 

200 

310 

1 

0020 

02 

200 

320 

10 

300 

000 

1 

0000 

10  200 

000 

2. 


I 


( 


( 


be 

0000 

00 

020 

200 

1 

0000  00  020  200 

beast 

0000 

10 

300 

000 

1 

0000  00  200  000 

been 

0000 

00 

000 

200 

1 

0000  00  000  200 

bilingual 

blaolcberry 

0002 

30 

100 

020 

1 

0001  20  100  010 

0200 

10 

300 

000 

1 

0100  10  200  000 

blood 

0200 

10 

320 

021 

1 

01C0  10  210  011 

bodily 

3002 

30 

100 

121 

1 

3001  30  100  111 

bones 

0200 

00 

320 

021 

1 

0100  00  210  011 

bore 

0000 

00 

200 

310 

1 

0000  00  200  320 

bom 

0002 

30 

200 

220 

1 

0001  20  100  120 

breath 

0200 

00 

320 

021 

1 

0100  00  210  011 

breed 

0020 

02 

200 

310 

1 

0020  02  200  320 

bridge 

0000 

00 

200 

310 

1 

0000  00  200  320 

brightness 

0200 

00 

320 

021 

1 

0100  00  210  011 

brllllanoe 

0200 

00 

320 

021 

1 

0100  00  210  011 

burnt-offering 

0000 

00 

300 

000 

1 

OOOC  00  200  000 

but 

0000 

00 

000 

000 

2 

0000  00  000  000 

buy 

0000 

00 

200 

310 

1 

0000  00  200  320 

by 

3120 

02 

020 

220 

1 

2210  01  010  110 

CALL 

0220 

02 

200 

312 

1 

0120  02  200  320 

Cambridge 

0200 

20 

320 

CGI 

1 

0100  20  210  001 

oan 

0220 

02 

320 

333 

1 

0120  02  210  232 

oandldates 

0200 

00 

320 

021 

1 

0100  00  210  011 

oannot 

0020 

02 

000 

312 

1 

0020  02  000  222 

oathollo 

0002 

30 

300 

020 

1 

0001  20  300  010 

ohapter 

0000 

00 

300 

000 

1 

0000  00  200  000 

oharged 

0002 

30 

100 

230 

1 

0001  20  100  320 

olassloal 

0002 

30 

100 

020 

1 

0001  20  100  010 

classification 

0200 

00 

320 

021 

1 

0100  00  210  011 

olear 

0002 

10 

120 

230 

1 

0001  20  120  322 

o lusters 

0220 

02 

300 

333 

1 

0120  02  300  333 

oold 

0202 

30 

320 

021 

1 

0101  20  310  011 

oo lie ot Ion 

0200 

00 

320 

021 

1 

010C  00  21 C  Oil 

oomes 

0020 

02 

000 

212 

1 

0020  02  OCO  222 

oommenoed 

CC22 

32 

100 

232 

1 

0021  22  100  322 

oonmunlon 

0200 

00 

320 

021 

1 

0100  00  210  011 

oonoluslon 

0200 

00 

320 

021 

1 

0100  00  210  011 

oondltlon 

0200 

00 

320 

331 

1 

0100  00  310  331 

confusions 

0200 

oc 

320 

021 

1 

0100  00  210  011 

conjectural 

0002 

30 

100 

020 

1 

0001  20  100  010 

oonneot 

0020 

02 

020 

212 

1 

0020  02  020  322 

oonneotlon 

0200 

00 

320 

021 

1 

0100  00  210  011 

connotation 

0200 

00 

320 

021 

1 

0100  00  210  011 

oonseorated 

0002 

30 

100 

230 

1 

0001  20  100  320 

contained 

0002 

30 

100 

230 

1 

0001  20  100  320 

oontenot 

context 

0200 

00 

320 

021 

1 

0100  00  210  011 

0200 

00 

320 

021 

1 

0100  00  210  011 

contrasted 

0022 

32 

100 

232 

1 

0021  22  100  322 

i 


3. 


( 


( 


contribution 

0200 

X 

oonvlnoing 

CIO:? 

30 

could 

0C2O 

02 

oourse 

0220 

02 

Cypriot 

0002 

30 

Cseoh 

0002 

30 

DEATH 

0200 

30 

decipherment 

0200 

00 

declaring 

0100 

X 

deems 

0000 

X 

definition 

0200 

X 

denied 

0002 

30 

denotation 

0200 

X 

department 

0200 

20 

describes 

OOCC 

X 

description 

0200 

X 

destruction 

0200 

00 

detail 

0200 

X 

determine 

OOCC 

00 

different 

0002 

30 

differently 

3000 

10 

dim 

0000 

10 

discussed 

0022 

32 

dlsousslon 

0200 

00 

dispelled 

CC02 

30 

diverge 

0020 

02 

divinity 

0200 

00 

division 

0200 

X 

do 

0020 

02 

doctrines 

0200 

X 

doing 

010C 

X 

domestication 

0200 

X 

down 

3302 

00 

dust 

02  00 

00 

dustbin 

0000 

X 

EIECTRICITY 

0200 

00 

eleotrosoope 

0X0 

00 

elements 

0200 

X 

else 

0021 

X 

emphasis 

0200 

X 

enjoy 

XOO 

X 

enough 

2002 

20 

enthusiastic 

X02 

30 

entire 

0000 

30 

establish 

0X0 

X 

everyday 

X02 

30 

evolutionary 

XX 

30 

320 

021 

1 

01X 

00 

210 

Oil 

130 

220 

1 

0201 

20 

120 

110 

xo 

312 

1 

0020 

X 

OX 

222 

200 

312 

1 

0120 

X 

200 

322 

300 

XO 

1 

XC1 

20 

3X 

010 

300 

020 

1 

0X1 

20 

300 

do 

320 

001 

1 

01X 

20 

310 

XI 

320 

XI 

1 

0100 

00 

210 

on 

210 

200 

1 

XX 

00 

220 

100 

XO 

210 

1 

XOO 

00 

XO 

220 

320 

021 

1 

0100 

X 

210 

Oil 

IX 

210 

1 

X01 

20 

IX 

320 

320 

021 

1 

01X 

X 

210 

Oil 

320 

001 

1 

01X 

20 

310 

XI 

XO 

210 

1 

XOO 

X 

000 

220 

320 

021 

1 

0100 

00 

210 

Oil 

320 

XI 

1 

0100 

X 

210 

Oil 

320 

331 

1 

01X 

X 

310 

331 

020 

210 

1 

0X0 

00 

XO 

320 

100 

XO 

1 

X01 

20 

IX 

010 

OX 

121 

1 

2X0 

10 

OX 

111 

120 

230 

1 

0X0 

20 

120 

322 

XO 

232 

1 

X21 

22 

000 

322 

320 

021 

1 

01X 
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symposiast 

system 

(  TAKE 
takes 
talk 
Tarpeia 
task 
tearing 
term 

terminologioal 

terminology 

text 

that 

the 

their 

them 

thereafter 

these 

things 


9. 


0202 

10 

320 

232 

1 

0101 

20 

310 

321 

0020 

02 

000 

112 

1 

0020 

02 

000 

222 

0202 

00 

320 

331 

1 

0101 

00 

210 

331 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0202 

30 

320 

021 

1 

0101 

20 

310 

Oil 

0002 

30 

100 

220 

1 

0001 

20 

100 

120 

0002 

30 

200 

230 

1 

0001 

20 

100 

320 

0002 

30 

300 

020 

1 

0001 

20 

300 

010 

0002 

30 

200 

230 

1 

0001 

20 

100 

320 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0000 

00 

200 

310 

1 

0000 

00 

200 

320 

0200 

00 

320 

331 

1 

0100 

00 

210 

331 

0020 

02 

020 

212 

1 

0020 

02 

020 

322 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0000 

00 

300 

000 

1 

0000 

00 

200 

000 

0200 

00 

300 

211 

1 

0100 

00 

200 

333 

0022 

32 

100 

232 

1 

0021 

22 

100 

322 

0020 

02 

200 

310 

1 

0020 

02 

200 

320 

0000 

00 

300 

000 

1 

0000 

00 

200 

000 

0020 

02 

200 

310 

1 

0020 

02 

200 

320 

0220 

02 

300 

333 

1 

0120 

02 

300 

333 

0000 

00 

300 

000 

1 

0000 

00 

200 

000 

0202 

30 

300 

231 

1 

0101 

30 

300 

331 

0020 

02 

200 

310 

1 

0020 

02 

200 

320 

0200 

10 

200 

230 

1 

01  oc 

20 

200 

320 

0000 

30 

100 

020 

1 

0000 

20 

100 

010 

0202 

00 

100 

020 

1 

0101 

00 

100 

010 

0000 

00 

200 

000 

1 

0000 

00 

200 

000 

0020 

02 

020 

212 

1 

0020 

02 

020 

322 

0000 

00 

300 

200 

1 

0000 

00 

200 

000 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0020 

02 

020 

212 

1 

0020 

02 

020 

322 

0020 

02 

000 

212 

1 

0020 

02 

000 

222 

0220 

02 

320 

333 

1 

0120 

02 

310 

333 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0200 

00 

300 

000 

1 

0100 

00 

300 

000 

0100 

30 

130 

220 

1 

0200 

20 

120 

110 

0200 

00 

200 

310 

1 

0100 

00 

200 

320 

0002 

30 

100 

020 

1 

0001 

20 

100 

010 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

0203 

21 

121 

021 

1 

0103 

21 

512 

on 

0000 

00 

100 

000 

1 

0000 

00 

100 

COO 

0000 

00 

100 

000 

1 

0000 

00 

100 

000 

0200 

00 

120 

020 

1 

0100 

00 

210 

010 

2000 

30 

000 

123 

1 

2000 

10 

000 

111 

0200 

20 

120 

021 

1 

0100 

20 

310 

Oil 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

jig##?***/** 


10. 


c 


this 

thought 

through 

throwing 

tie 


tiny 

to 

too 

tooth 

topics 

torn 

toyed 

traditional 

translations 

tried 

truths 

two 


0200  20  120  021 
0220  32  320  233 
3122  02  020  220 
0302  00  310  223 
0020  02  200  310 
0002  30  100  020 
2100  00  010  220 
1000  10  000  002 
0000  00  300  000 
0200  00  320  021 
0002  30  000  220 
0020  02  000  212 
0002  30  100  020 
0200  00  320  021 
0022  32  100  232 
0200  00  320  021 
0202  3  0  320  021 


1  0100  20  310  011 
1  0120  22  310  333 

1  2211  01  01 0  110 

1  0301  00  230  111 

1  0020  02  200  320 

1  0001  20  100  010 

1  2200  00  010  110 

1  2000  10  000  001 

1  0000  00  200  000 

1  0100  00  210  011 

1  0001  20  000  120 

1  0020  02  000  322 

1  0001  20  100  010 

1  0100  00  210  011 

1  0021  22  100  322 

1  0100  00  210  011 

1  0101  20  310  011 


UNDERSTAND 

unfortunately 

unlikely 

up 

upon 

us 

use 

VARIES 

various 

Veda 

venture 

veriglability 

verification 

very 

view 

virgin 

visible 

vision 

vote 


0020  02  020  212 
3000  10  OCO  121 
0002  30  100  020 
3120  02  020  220 
0120  02  020  220 
0200  00  120  020 
0200  00  320  331 

0000  00  000  210 
0002  30  100  020 
0000  00  300  000 
0020  02  200  310 
0200  00  320  021 
0200  00  320  021 
1000  10  200  000 
0200  00  200  310 
0002  3  0  300  020 
0002  30  100  020 
0200  00  320  021 
0020  02  200  310 


0020  02  020  322 
2000  10  000  111 
0001  20  100  010 
2210  01  010  110 
0210  01  010  HO 
0100  00  210  010 
0100  00  210  331 

OOCO  00  000  220 
0001  20  100  010 
0000  00  200  000 
0020  02  200  320 
0100  00  210  011 
0200  00  310  011 
3000  10  100  000 
0100  00  300  320 
0001  30  300  010 
0001  20  100  010 
0100  00  210  011 
0020  02  200  320 


1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 


WAS 

watoh 

way 

ways 

we 

went 

what 

wheeled 

when 

where 


0020  02  000  312 
0220  02  320  333 
0000  00  320  021 
0200  00  320  021 
0000  00  100  001 
0020  02  020  212 
0210  01  001  011 
0002  30  100  230 
0001  01  011  001 
0001  01  011  001 


0020  02  000  222 
0120  02  310  333 
0000  00  210  011 
0100  00  210  011 
0000  00  200  001 
0020  02  020  322 
0110  01  002  011 
0001  20  100  320 
0002  01  012  001 
0002  01  012  001 


1 

1 

1 

1 

1 

1 

1 

1 

1 

1 


11. 


whether 

0000 

00 

001 

000 

1 

0000 

00 

002 

000 

whloh 

0200 

21 

121 

Oil 

1 

0100 

21 

112 

oil 

while 

0001 

01 

311 

001 

1 

0002 

01 

212 

001 

white 

0202 

30 

320 

021 

1 

0101 

20 

310 

on 

who 

0000 

01 

001 

001 

1 

0000 

01 

C02 

001 

whom 

0200 

01 

021 

010 

1 

0100 

01 

012 

010 

why 

0000 

01 

001 

001 

1 

0000 

01 

002 

001 

will 

0220 

02 

220 

333 

1 

0120 

02 

210 

232 

winds 

0200 

00 

300 

211 

1 

0100 

00 

200 

331 

with 

0122 

02 

020 

220 

1 

0211 

01 

010 

no 

without 

0122 

02 

020 

222 

1 

0211 

01 

010 

111 

works 

0220 

02 

320 

333 

1 

0120 

02 

310 

333 

would 

0020 

02 

000 

113 

1 

0020 

02 

000 

222 

would-be 

0000 

30 

100 

000 

1 

0000 

20 

100 

000 

written 

0000 

30 

100 

220 

1 

0000 

20 

100 

120 

wrong 

0202 

10 

300 

230 

1 

0101 

20 

300 

320 

wry 

0000 

30 

100 

000 

1 

0000 

20 

100 

000 

YEAR 

0000 

00 

300 

000 

1 

0000 

00 

200 

000 

yet 

2000 

00 

000 

121 

1 

2000 

00 

000 

111 

you 

0200 

00 

120 

021 

1 

0100 

00 

210 

Oil 

your 

0000 

00 

100 

000 

1 

0000 

00 

100 

000 

•RE 

0000 

00 

000 

100 

0 

0000 

00 

000 

200 

•s 

0200 

00 

000 

100 

0 

0200 

00 

000 

200 

1950 

0200 

00 

320 

021 

1 

0100 

00 

210 

Oil 

( 

0110 

01 

000 

000 

0 

0210 

01 

000 

000 

) 

0220 

02 

000 

000 

0 

0330 

03 

000 

000 

t 

2222 

22 

222 

222 

2 

3333 

33 

333 

333 

1 

0000 

00 

000 

002 

0 

0000 

00 

000 

003 

t 

0000 

00 

000 

001 

0 

0000 

00 

000 

003 

t 

. 

0000 

00 

000 

002 

0 

0000 

00 

000 

003 

? 

* 

ooco 

00 

000 

002 

0 

0000 

00 

000 

003 

• 

0000 

00 

000 

002 

0 

0000 

00 

000 

003 

Participation  Classes  of  Substituent  Typos 


Adverbial  Group 

OCOO 

30 

000 

121 

1 

0000 

10 

000 

111 

Adverbial  O-Clause 

Normal  Case 

3000 

30 

200 

021 

1 

2101 

10 

100 

Oil 

Relative  as  Dependent 

0000 

01 

220 

000 

1 

0000 

01 

110 

000 

Adverbial  S-Clause 

0200 

00 

020 

021 

1 

0100 

00 

010 

053 

Adverbial  Subjunot 

0000 

20 

200 

021 

1 

0000 

10 

100 

Oil 

Ad junotivo  Group 

0002 

00 

300 

021 

1 

0001 

00 

100 

Oil 

Adjunotive  3-Clause 

0000 

00 

200 

020 

1 

0000 

00 

100 

010 

Nominal  Group 

0222 

02 

020 

021 

1 

0111 

01 

010 

Oil 

Nominal  O-Clause 

Normal  Case 

0202 

20 

200 

021 

1 

0101 

10 

no 

Oil 

Relative  as  Dependent 

0000 

21 

000 

000 

1 

0000 

11 

000 

000 

Nominal  Subjunot 

0000 

00 

020 

021 

1 

0000 

00 

010 

Oil 

Operative  Group 

0020 

02 

000 

013 

1 

0020 

01 

000 

022 

Operative  3-Clause 

0020 

02 

020 

012 

1 

0020 

02 

020 

122 

Full  Clause 

0002 

00 

002 

003 

1 

0001 

00 

001 

001 

■*  *  a  "relative  dependent"  Is  an  Ad junotivo  S-Clause,  or 

any  substituent  itself  having  a  relative  dependent* 
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A  System  for  producing  English  Syntactic  Dictionary  Entries 

Tha  method  of  'sorting  frame  s’  set  out  here  has  a  limited 
purpose:  to  extend  the  Parker-Rhodes  Syntax  Dictionary  (PRSD) 
without  having  to  decide  the  Governor-Dependent  information 
entry  for  eaoh  word  digit  by  digit.  We  have  worked  within  the 
general  framework  of  the  PRSD  method  to  the  extent  that  we  have 
l  considered  the  12  substituent  types  (for  English  words)  to  be 
adequately  determined  by  the  list  of  examples  supplied  by  its 
author.  -4-  It  should  be  noticed  that  defects  in  the  original 
method  of  classifying  some  kinds  of  substituents  persist,  e.g. 
all  verb  groups  without  noun  objects  are  considered  to  braakst 
in  type  (10)qp  ,  hence  we  have  no  distinction  between  the 
functions  of  adjectives  and  past  participles  in  this  type, 

Easing  this  work  upon  the  examples  has  to  a  large  oxtent  made 
it  independent  of  the  PRSD  itself.  Thus  our  governor-dependent 
*  entries  for  words  occurring  in  the  PRSD  may  be  different  from 
those  to  be  found  there.  The  present  work  thus  checks 


-f-  For  the  list,  see  Appendix  I,  For  a  disoussion  of  the 
substituent  typos  themselves  soo  Parker-Rhodes*  et  ol,  "A 
Lattice  Model  of  Syntactic  Description'1,  M,L,  147,  "Supplement 
I,  to  J.I.L,  147",  M,L.  154  and  "How  to  Construct  a  Syntactic 
Dictionary  Entry",  M,L,  153, 

¥  The  twelve  substituent  typos  will  be  referred  to  in  this 
way  throughout. 
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existing  entries  as  well  as  producing  now  entries  on  a  large 
scale.  Experiments  will  show  whether  the  entries  produced 
by  the  present  method  bracket  bettor  than  the  previous  entries; 
the  method,  however,  as  opposed  to  the  original  entry  by  entry 
approach,  has  very  considerable  advantages:  onoe  words  have 
boon  assigned  to  parts  of  speech,  consistent  detailed 

entries  oan  be  allocated  in  a  quasi-automatic  way. 

General  Hothod 

The  general  method  is  to  sort  alphabetically  punched  word 
cards  into  heaps,  each  consisting  of  words  with  a  common 
governor-dependent  ontry;  this  entry  can  then  be  gang-punohed 
through  the  pack  from  a  master  oar’d. 

The  sorting  i3  done  by  asking  a  series  of  questions.  A,  which 
divide  the  words  into  five  classes  corresponding  crudely  to 
parts  of  speech.  Further  series  of  questions  are  then  asked 
for  each  of  these  classes,  and  the  answers  to  these  lead  to 
tables  whioh  give  the  required  governor-dependent  dictionary 
entrios.  One  question  in  A  rejects  words,  classified  as 
•fancy’,  which  will  require  individual  entries.  It  is  thought 
that  thoro  will  not  be  many  of  these. 
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Questions  A 
Question 

1.  Is  the  word  ’fancy1,  i.e. 

i)  subjunotion 
ii)  conjunction 
111)  part  of  the  word  "to  be" 
iv)  pronoun 
v)  auxiliary  verb? 

2.  Is  it  a  preposition? 

3.  Can  it  have  any  adverb  uses? 

4.  "  "  "  "  verb  " 

5.  "  »  "  "  noun  " 

5.  "  "  “  "  adjective  uses? 

At»  remaining  word  (e.g,,  inter¬ 
jection) 


If  the  answer  is  "yes" 


R  (rejeot) 
Sec  Tablo  III. 


it 

it 

m 

n 


IV. 

ii. 

i. 

v. 


R 


Table  I 


( 


Questions 

Answer 

1. 

Has  the  word'  noun  uses? 

Yes- 

nr 

No 

"1 

2. 

Has  it  adjective  uses? 

A 

A 

3. 

If  a'  verb,  can  it  stand  alone 
with  subject  but  without 
qualifier  or  object? 

I 

I 

4. 

Must  it  stand  (o "exists") 

14 

(R) 

5. 

./i.tihout  t,r.  object. 

If  an  adjective,  is  it  only  one 
if  procedod  by  an  adverb?  „  . 

(e.g.V  "bred") 

A  4 

(R) 

6. 

If  a  noun,  oan  it  be  used  without 
an  artlole  or  other  modifier? 

0 

G 

7. 

If  0,  oan  it  be  used  without  an 
artiolo  or  other  modifier  after 
a  preposition  in  type  (2a)? 

a 

0  — 
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Note  on  tha  ooding  of  adjectives 


A  word  is  an  adjeotive  if  it  can  qualify  any  noun  at  all,  even 

as  an  article.  If  it  can  only  function  thus  it  is  called  a 

'nounish'  (n)  adjective,  written  with  subscript,  A.  If  it  oan 

n 

also  plausibly  be  used  after  ’’when"  ©i*  "if"  (e.g.  "when  drunk 

I  drove  badly")  and  can  bo  qualified  by  an  adverb  (e.g.  "very 

drunk")  it  is  callod  a  ’plain*  (p)  adjective,  written  with 

superscript,  In  the  tables  alternative  readings  of  governor- 

dependent  entries  for  those  two  are  given  by  two  figures,  one 

above  the  other:  for  A  read  the  lower,  for  1  the  upper.  If 

n 

tho  adjective  is  noithcr  ’nounish'  or  ’plain'  the  word  is 
rejected  as  'fancy'. 

A  adjectives  can  in  general  be  used  to  qualify  gerunds  ("man 
eating  tiger")  whilo  £  cannot.  Noto  also:  for  gerunds  the 
criteria  for  adjectives  are  different: 


If  A  they  can  qualify  any  noun 
n 

P 

If  A  they  can  qualify  any  noun  and  be  qualified 

by  an  advSrE. 


Examples  a,  "Cypriot".  Ref.  table  I. 

P 

Coding:  NAG  (I/I  irrelevant) 

b,  "occurs".  Ref.  tebio  II,  (nullified  by 

reject  below) 
Coding:  NAT.-f ,  I-f-  e  R, 

(G/G  irrelevant) 


5. 


o.  "others",  Ref.  tnblo  I. 

Coding:  NAG  (I/I  irrelevant) 

Table  I 

Code  Plot Iona ry  Entry 

NAG  01  0^03  i  oj  11 


{ 


NAG 


NAG 


010000210011 


(  Oj  Ip  11 

oio  ;o3oo;Jo 
(!)  0  0  0  0 


(0) 

NAG  010000200000 

(0) 


Note  on  the  Code  of  G- 


Alternative  readincs  for  G-  are  given  in  brackets;  there  will 
always  bo  two  such  entries,  to  account  for  both  <n*  and 
adjectives,  even  where  the  two  entries  aro  the  same*  Thus  the 
word  "hospital",  in  Table  I,  will  have  the  coding  NAG-  andnthe 
entry  0100  0030  0000,  that  is,  the  lower  alternative  for 

the  ooding  NAG* 


f 


Answer 


6. 

Table  II 
Questions 

1-7.  ASK  THE  QUESTIONS  GIVEN  UNDER  TABLE  I 


8.  Is  the  verb  a  present  tonso  first  VI 

person  singular  form? 

9.  Is  it  a  present  tense  third  person  V2 

singular? 

1  10,  Is  it  a  past  tense  third  person  V3 

singular? 

11,  Is  it  a  past  tense  and  past  V3l 

participle? 

12,  Is  It  a  post  participle  only?  V4 

13,  Is  it  a  gerund?  V5 


#-  l.o.  of  the 

samo  verb. 


Table  II 


Code 

VI 

V2 

V3 

NAIG 

12 

010oc2310333 

12  3 

0120023l0g33 

NAIG 

(0)l  ?  3 

010^23  0C3qR 
(l)00  2 

(Ol2  33 

012002300222 

NAIG 

010^0310331 
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Table 

II  - 

continued 

MIG 

(0)l2  3 

010^03003^0 
(D°0  2 

MIG 

010002210333 

012002210233 

MIG 

(0) 

010002200322 

(0) 

MIG 

010000210331 

MIG 

(0) 

010000200320 

(0) 

mi 

12  3 

0000021003g2 

MI 

000^01003|0 

12  33 

000oo0100s2° 

MI 

000002000322 

002002000222 

MI 

000000000320 

000 000000220 

Table 

II  - 

continued 

Code 

V3 1 

V4 

V5 

MIG 

docs  not  occur 

docs  not  occur 

MIG 

n  ii  it 

ii  it  it 

MIG 

03010*0330111 

8. 


Table  II 

-  continued 

NAIG 

doos  not  oocur 

NAIG 

does 

not 

occur 

n  n  ti 

NAIG 

ti 

ti 

it 

n  it  it 

NAIG 

030000230111 

NAIG 

doos  not  oocur 

NAI 

002qq21003|2 

docs 

not 

occur 

does  not  oocur 

NAI 

000^01000^0 

2* 

0201Q  0120110 

mi 

002002000322 

docs 

not 

oocur 

does  not  occur 

NAI 

OOOOC 0000320 

OOOOOOOOOIOO 

020000020110 

Note 

Only  cases  whloh  havo  actually  occurred  In  the  dictionary 

sample  to  dato  have  been  considered.  Blanks  will  be  filled  In 

when  necessary,  Casas  which  cannot  occur  aro  indicated, 

^  N,B.  For  gerunds  which  arc  adjootivos  see  the  Note  on  the 
ooding  of  adjootivos  as  ’p’  and  ’n’  above. 

Assumptions  for  all  gorunds. 

If  they  are  nouns  they  are  ooded  G,  if  verbs  ore  coded  I;  they 
all  depend  in  type  (4),  do  not  depend  in  type  (2b),  and  depend 
in  type  (11)  (save  I*  gerunds  like  "occurring"). 
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Table  III  (Propositions) 

BASIC  ENTRY:  2200  00010110 

(adjustments)  N  V  A  A  V  N  V  V  N 

A  V 


Assumptions: 

All  prepositions  aro  adverbs. 
No  propositions  are  gerunds. 


Table  IV  (Adverbs) 

Additional  quostlon 

Can  it  modify  other  advorbs?  Yos  s  3  in  (1). 

EASIC  ENTRY:  2000  1000  0110 

(adjustments)  N  V  A  A  V  N  N  VVN 

A  V 


Assumption: 

All  adverbs  oan  qualify  adjectives. 


Tablo  V  (Ad.lootlves  without  noun  or  vorb  us  os) 

Thoy  ore  all  ’pinin’  adjectives. 

BASIC  ENTRY:  0001  2010  0110 

Note  on  Tables  III,  IV  and  V 

For  theso  no  coding  is  used.  For  Tobies  III  and  IV  basic 
entries  ere  givon  which  arc  modified  if  the  word  has  In 
addition  noun,  vorb  or  adjective  functions.  Tho  letters  If,V, A 
aro  marked  undor  tho  rolovont  columns  for  adjustment  in  such 
oasos  according  to  Tablo s  I,  II,  and  V. 


EXAMP  IE 3 


10. 


1)  "arm"  Tablo  II  s  0100  0230  0322 

oodo  VI/  N£IG 

PRSD  ontry  is  002002200220 

whioh  is  wrong  in  oach  of 'tho  four  positions  in  which 
it  diffors  from  tho  table. 


2)  "agrioulturo"  Tablo  1  *  0100  0021  0011 

oodo  NAG 

PRSD  entry  is  0100  0021  0011 


3)  "of"  Tablo  III  *2200  00010110 

(simple,  no  N,V,A  functions) 

PH3D  entry  is  0210  01010110 

whioh  is  wrong  in  oaoh  of ‘tho  four  positions  in  which 
it  diffors  from  tho  tablo . 


APPENDIX  I 


ENGLISH  EXAMP  IE 3  PCR  THE  SYNTAX  DICTIONARY 


Tho  intorlingual  name  is  given  first,  the  English  namo  sooond. 
The  governor  is  underlined. 


0,  Conjunct  Group  a  CONJUNCT  GROUP 

boforc  tea  and  eftorwards 
excellent  and  of  good  report 
men,  women  and  children 
soak  or  rinse  tho  clothes 
I  carac“but  found  no-one 


1.  Adverbial  Group  e  ADVERBIAL  GROUP 

rather  !  nloolv 
exactly  i  in  the  middle 
not  |  at  all  . 

excessively  j in  my  opinion 


2.  Adverbial  O-Clauso  ■  PREPOSITIONAL  PHRASE  CR 

PARTICIPLE  CLAUSE 

A)  in  jdobt 

From  (whom  (with  rolntivo  pronoun 

“  as  dependent) 

among  j  the  savage  tribos 
ou't " of '  j  jail 

B)  thinking  |hrppy  thoughts  (all  these  participial 

man  |  eating  [tigers  “]  clauses  can  also  bo  used 

accompanying  T  her  husband  as  infinitive  (gerund) 
- — '  clausos) 

3,  Adverbial  S-Clnuso  «  ABSOLUTE  CLATBE 

whenever  1 1  go  [he  *b  out]  (all  those  have  also 

n  don’t  c are”]  what  you  do  noun  function)  (Wwhatu 
fTowovorl  I  try  tl  faiajj  is  exceptional,  as"  It  al« 

QC  shan’t  do  ltj  whoover|  says  so  ways  mokes  a  noun 

clausoj  but  it  is 
still  counted  in  this 
typo) 


Adverbial  Sub.lunct  s  ADVERBIAL  CIA  USE 


if  |  pigs  had  wings  | 

[TSon’t  drivQ~)  when  i drunk 
when  |  my  ship  comes  homo 
Tn  order  that  |  they  should 


(ad jeotivo 
as 

try  harder  dependent) 


Ad.lunotlvo  Group  a  ADJECTIVAL  GROUP 

very  I  pretty 
larger  T than  life 

tunotual  jto  tho  minu^o 
oavy  and  gross  I  beyond  boliof 
almost  |  black  |  on  the  sides 


(two  depen¬ 
dents  hero) 


Ad.lunotlvo  S-Clauso  -  RELATIVE  CLAUSE 


who  )  state  and  stare 
'  tha  t  i  pass  In  the  night 
I  have  known 


(rclativo  pro¬ 
noun  zoro) 


Nominal  Group  a  NOUN  GROUP 
rather  horrid  old  j  men 


long  j  word 3  | which  no  one  can  follow 

little  pitohors 
the  j  moon 


(two  dopen- 
donts) 
(two  depen¬ 
dents  ) 


Nominal  O-Olauso 


INFINITIVE  (OR  GERUND)  CLAUSE 


1  .  ■_  _ 

to  mako  !  dictionaries  fTs  oxhaustingj 
thinking  I  happy  thoughts  [is  funj 
novor  to  ha vo  made  |  mistakes  Q.s  not 
to  sudd  on?  y  stop  T  talking  Qs  "rudCl 
JTT  TIT  "tHatTTo_  ‘make  j  you  laugh 

jj3 omo th lngj  to  mako  j  you  laugh 

pa  buxT~j  to  oat  J  whioh  [is  risky  J 


d 


to  have  loarn 


( oxamplo  of 
adverbial  uso) 
(oxamplo  of 
adjectival  uso) 
(with  relative  pro¬ 
noun  as  depondont) 


9.  Nominal  Subjunot  -  NOMINAL  CLAUSE 

(jf  foundJ  that  jthoy  woro  equal 
!  she  askodn  v/Hon  I  tho  7  would  arrive 
1*1  wendor^wEotHor  that  is  truo 

j=I  thlrilc|7.  |  you're  a  fool  (governor  zoro) 


t. 


10,  Opera tivo  Group 
LlJ  ha vo  loomo 


VERB  GROUP 


jjTho  EouscT]  will  ha  vo  boon  bolng 
rThcyl  soon  Tdl'fl 

po~|Porhapa'|Blijit  havo  [already 

HEhalH  may 
[~Shc_]  (too 

LThat 3  Is  labsurd 
flTou  (roolly  i  look  |  ridiculous 


nevor  happen 
often)  [forgets 


(mainvorb  as_dopendont ) 
built  jtwo  wo e ks] 

(advorb  as  dependent) 
arrived  (three  depend¬ 
ents  ) 

(two  dependents) 
(complcto  verb  as 
governor) 

(to  bo  with  ad jo o tivo) 
(comploto  verb  with 
adverb  and  adjective  as 
two  dopondonts) 


( 


11,  Operative  S -Clause 


PREDICATE 


liko  Jit 


I  Thoj]  _ 

i~5ho  J  lovos  fhlm 
fl  Toft'en' tried  to  cross  | tho  little  river 
fTfc  ;  gave  I  me  some  books 
j^fouj  sKould  paint  j  tho  door  whito 


That  noods  doing 


(two  dopondonts) 
(two  dependents,  one 
an  ndjootivc) 


12 .  Full  Clauso  ■  FULL  CLAUSE 

It  j  is  reining  , 

It  is  difficult  to  do  that 


People  !  like  children 
I  Go  to  He  11  *” 


C5ho«s  thoroO  Mo  • , 


Tho  aoro 


the  morrlor 


(note  "it"  of  concord 
boforo  infinitive 
clauso) 

(omission  of  dependent 
makes  imperative ) 
(omisdon  of  govomor) 
(anomalous  type  :  the 
sooond  half  may  be  re¬ 
garded  as  a  predicate 
with  zoro  verb) 
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d  f4P 
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O  P 
bo  to  H 
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■rl  p  w 
p  p  o 
cl  o  bo 
bO^i  d  O 
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